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(57) Abstract: The invention relates to a method for devel- 
oping a translator and a conesponding system, which transla- 
tor is intended to translate the code of an input language into 
that of a target language. In the method, a descriptive lan- 
guage (V) is used to depict on a semantic level two source 
languages that are independent of each other, that is, the said 
input language (X) and a tai^et language (Y) and conversion 
instructions are prepared using this descriptive language. The 
conesponding system includes the translator and accessory 
files, which convert the input language first of all into the se- 
mantic descriptive language (VX), which is converted to the 
target language converted to the descriptive language (VY), 
fiom which the taiget language code is finally generated. In 
this way, the conversion instruction is as small as possible. 
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METHOD FOR DEVELOPING A TRANSLATOR AND A CORRESPONDING SYSTEM 

The present invention relates to a method for developing a translator and a conresponding system, 
which is intended for converting an input language code into a target language. In the method, a 
5 descriptive language (V) is used to Ibmfially describe two source languages independent of each other, 
i.e. a so-called input language pC) and a so<:alled target language (Y), each source language consisting 
of fomial terms (XI, Y,), vrith one or several occumences in each master temf), and possible parameters 
In these. A program ftamewortc is created for the translator, along with a group of files, v*jch are linked 
together and translated for the selected operating system. 

10 

Every fomial language, including all programming language and fomnal specification languages, can be 
defined with the aid of its clauses, i.e. structures, in such a way that the possible clause types and 
possible sub-clause types of each clause are defined, as well as the external appearance of the 
language (syntax), which contains the Ic^words and defines the fixed order of woids and terms. Each 

16 temri can be a reseived word, such as an indivisible tenti (T), or a conceptual type (C), or a divisible term 
(N). which is a higher component and is constructed from other types. The higher tenfns are called 
starting symbols (8). Each term type has a name, a production (P), i.e. a master temn. The production 
can be either a list (L). a divisible term (N), an indivisible temi (T), or a certain type (C). Formal grammar 
is a combination of all of these (S,P,T,N,C,L). An entire grammar can be formed, starting from the 

20 starting symbols using a typical tree run-through routine, which ends in indivisible terms, which are the 
leaf nodes. 

Translation between languages can be described as follows. The code is read using a scanner 
operation, which divides the code into key temis (token list). A parser operatfon is used to group the key 

25 terms to form a parser tree, which is then parsed to give indivktual temis, for the processing of which 
specific rules are created. In known converskm processes from one language to another, the basic 
tenns, i.e. keywords, are found either from the glossary of the language (reserved words), or from its 
libraries. The statements of a fomnal language are typically divided into consecutive structures, control 
structures, selection structures, and data structures. Consecutive structures are converted directty and 

30 the correspondences for them are developed one clause and term at a time. Control structure 
conversfons Into a higher-level language are simple, as the correspondence can be developed directty, 
or with a small amount of alteratton. Control structure conversion Into a lower-level language demands 
manual examlnatk>n in cases using structures not found in the new language. For example, converting 
C's for-kx)p Into Pascal's for-loop will not succeed as such, if there are operations other than those 

35 relating to bask; incrementation or decrementation in ttie control components (incrementation, tenninal 
component). Thus, it is best to define tiie Pascal conrespondence as, for example, a "REPEAT-UNTIL" 
loop, suitabty supplemented. The selection structures of different languages resemble each another. 
The data structures are made compatible using libraries and defined function calls, or macros. When 
dealing with a dynamte memory, function correspondences are formed between ttie languages. 

40 

The known way to make translators is based on ttie Abstract syntax tree principle (AST), or ttie Object 
Framewori^ principle (OF), or a meta-language. AST contains the target-language grammar, specific 
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program structures, dnd a rule presentation form, by means of which the input language is converted 
into the target language. In the OF principle, objects and classes, in which code for the new language 
and its terms are written, are created to conrespond to the tenms of the input language. In the OPs 
classes there is also code, which performs the parsing operation of the relevant class distributedly. In 
5 the methods refenred to above, the connection between the languages takes place as a syritax tree. 
Known methods are disclosed in, for example, the follovwng patents: EP 0371943, US 5,768,564 and US 
4,980,829 and US 4,729,098, and GB 2327788. All the known translators result in rather cumbersome 
applicattons. When translating the same language to several other languages It has sometimes been 
possible to reduce the amount of input-language processing. In the system according to the 
10 aforementioned EP publication, objects according to the target language are made from the input 
language. Several files containing semantic and code-generator portfons. which the user must form, are 
combined with these. The system then fonns the input file directly into target-language code. 

The present invention is intended to create a simpler method for devetoping a translator and a system 
15 for implementing the translator. Tlie charaderistk: features of the method according to the inventk>n are 
stated in the accompanying Claim 1 and the characteristic features of the conresponding translator 
system are stated in Claim 6. According to the m^hod, the structore of both languages in descrit>ed as 
a grammar, while at the same time a suitable semantic name, corresponding to the syntax structure, is 
chosen for each term. The conversk>n is made on a semantic level, in such a way that inessential syntax 
20 infomnab'on is removed from both source languages, by fomiing descriptive language versions of them. 
At the same time, ttie necessary accessory files for tiie translator can be fornied from the information 
obtained and from the source language, so that the translator devetoped forms a parsing tree in ttie said 
descripthre language, which can now be converted using a minimum of knowledge. Similarly, the 
fonmation of such a conversion instruction for each source-language temn and its presentotion requires 
25 much less work than when using known ti^nslators. The making of the descriptive language verston is 
a straightfonvard operation and preferably takes place already when entering \he grammar. With the sad 
of these operations and of the fonnal original grammars flie accessory files used by the translator are 
created automatically. 

30 The inventfon has the following advantages: 

The application of ttie mettiod between several languages, particulariy from one 
language to several languages, gains ttie conskierable advantage ttiat flie input language 
needs to be converted only once into the descript'rve language. 

35 

The translator itself forms a relatively light program while ttie conversion instruction file 
used by it is ttie smallest possible. As stated above, Uie accessory files Increase ttie 
amount of information to some ^nt, but ttiey are relatively direcHy constoucted lists. 
The mettiod according to ttie invention provides an opportunity to test ttie method by 
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converting the program code translated to the intemnediate langu^e back to the input 
language. 

Instead of a programming language, the source language can also be a documentation 
5 language, particularly the XML format. This permits automatic documentation. The 

source language is a documentation fonriat of a selected fomi. (e.g. module print-out. 
variable list, cross-reference table, graphical hierarchal diagram, data-flow diagram). 

In the following, the invention is examined with the aid of an example, which is shown in the 
10 accompanying figures and in the code print-out The example depicts the development of a translator 
from the simple Minilan language to the C language. The grammar of the IMinilan language is described 
on page 12, the necessary samples of the C language appearing on page 19. 

Figure 1 shows a flow diagram of the method according to the Invention for 

15 developing a translator 

Figure 2 shows a flow diagram of the operation of the translator. 

Rgures 3 and 4 show a graphical interface for interaction connection 

Figure 5 shows the semantic tree structure of the l\/linilan language with links to ttie 

C language. 

20 

The method according to the invention is suitable for developing a translator between two arbitrary 
source languages. In Figure 1 , tiie first source language is an input language X and the second source 
language is a target language Y. In the first stage, their grammars must be stored in a fonnal database, 
stages 1 and 1', which are 'handcraff stages that are independent of each other, but in which the same 

25 predefined naming practice (N) is used to pemnit later automatic processing, which is merited with the 
reference number 100. The implementation of tfie method requires a computer system together witti 
software, which is later called ttie translator developer. In the accompanying examples, one preferred 
format Is shown, according to which reserved words are written in small letters and the terms to be 
divided into parts (defined = non-tenminais) are written in capitels and specific tenns are replaced with 

30 selected signs and words, for example, is "plus". is "eq", is "IT, a dot is "dor In the example 
shown here, ttie input language used is \he concise Minilan language and ttie source language is ttie C 
language. 

In the following stage, a descriptive language version Is fomied, in this case typified for the PROLOG 
35 language, ttiough some other typified, associative, and semantic representation may be envisaged, 
stages 2 and 2\ This takes place automatically, according to preselected mles. The Prolog language is 
described in ttie following publications: "Programming in Prolog". 1984; W.F.CIocks C.S. Mellish (ISBN 
3-540-15011-0 ISBN 0-387-15011-0). Visual Prolog Perusteef ('Foundations of Visual Prolog', in 
Finnish), 1995; Ericki Laitita, (ISBN-952-9823-51-7), "The Art of Prolog- advanced Programming 
40 Techniques"; Leon Steriing. MIT Press 1987 ISBN 0-262-19250-0. It is preferable to use Visual Prolog 
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Developer (Prolog Development Center, DK). in which the necessary inference engine is integrated. The 
manufacturer of the developer has published the title: "PDC Prolog 3.20 Toolbox", DK 1990. 

Though the temis of both languages are converted into the descriptive language according to the same 
5 mles, as groups they sti)l do not conespond to each other at all. Though some of the terms even have 
the same names in the descriptive language, they do not necessary correspond to each other, while 
there are several differences in the use of parameters. In the descriptive language version of the richer 
language, there are many tenons that do not appear in the version in the other language. The making of 
the descriptive language conversion of both requires a simple PROLOG program or similar, the 
10 operation of which is shown clearly in the examples described later. In the examples according to the 
descriptive language, the words written in capital letters are divisible temns. 

Expression in the descriptive language takes place very formally at a semantic level. It has a condse 
syntax relating to the manner of writing the grammar, each structure being given a clear syntactic name. 

15 The relation of the syntax to the ft>rm of the descriptive language is stored in accessory files, which is 
described later in greater detail. Each descriptive language conversion includes simplified information 
on the grammar of the source language. Here, the temn file refers generally either to separate files In a 
mass-memory device, or to a group of records stored in a database. In practical applications, all the 
essential information is stored in a single database, because in most development environments data 

20 processing is then easiest 

At this stage, accessory files 31 and 32 of the descriptive language conversions and the formally stored 
grammars can be formed entirely automatically for the translator (stages 3 and 3'). The conversion 
algorithms are g'rven later. The glossary and scanner tenms, datatypes, parsing logic, generating code, 
25 and format clauses are shown as the accessory files VX(a-e), 31 and conrespondingly VY(a-e), 31' of 
Rgure 1. 

When the divisible temris are broken down into components at the lowest level, the reserved words 
remain to form the glossary for the scanner. The datatypes comprise the original terms and their 
30 hierarchic levels bound to the descriptive language occunrences. 

The parsing logic gives formulae for converting each occurrence from the original code into the 
descriptive language fonri. Here there are generally several hierarchic levels, so that the master terms 
and the subsidiary tenns are each set down In their own code. It is essential that the divisible terms fbmi 
35 internal loops, which must be broken down until no more divisible tenns can be found. Here it is possible 
to use some known high-grade parser logic, for example, the LL{k)-parser, which is a top-down type 
algorithm. 

The generating codes show an essentially opposite process to the parsing logte and the format clauses 
40 are essentially opposite to the scanner operation. If the same source-language scanner, parser. 
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generating, and formatting clauses are applied to the selected target code, the result should be the 
original code. This can be used to check the translator software. 

With reference to Figure 1 , after mnning the accessory files, the interactive connection of the input and 
5 target language descriptive language terms is canied out, stage 4. This is "handcraft", but it preferably 
takes place using a graphical interface 5 and an inference engine exploiting one or several of the 
following simple criteria: 

- the mutual combinafton of prevtously connected occurrences is proposed, 

the mutual combinatton of occunrences/parameters with the same names based on the 
10 descriptive language temns is proposed, 

the mutual combination of parameters on the basis of sequence is proposed. 

The graphical interface includes features supporting translator development a structure editor, whk^h 
recognizes the structures of the input and target languages and pemnits a converston instmction to be 
15 directed to an ever more precise level, by clk^king on the corresponding term 

an interpreter (a sub-group of the inference engine), whteh checks the logicality of the conversion 

instruction: 

each parameter should have at least two uses, expk)itation of locatton and value 
each converston instaictton used that have been refened to must have a definitk>n code 
20 - if the conversion instruction parameters are given names conesponding to their types, it 
will be possible to carry out type-checks for each conversion instmction operation, even 
before the development of the translator 

the interpreter can be used to test the conversion instructions as independent totalities 
prior to the devetopment of the translator. 

25 

Thus, the connection of the tenn VX, of the input language to the temri VY„ converted into the selected 
target language is cam'ed out in the steps: 

- the master terms are connected to each other 

- the occunBnces are adapted to each other. 

30 After this, the conversion instruction (VX^-VY) of each converted input-language tenn (VX,) is stored in 
file 41 for the translator. 

In connection with the completion of the conversion instruction, the part of the target-language 
generating code is retrieved linked to the conversion instrudton tfeadh master term, which is necessary 
35 for the breaking down of each converted master tenn into parte, and is stored In the file 42. 

Now the complete infonnatton is ready for the translation of code in an ari^itrary input language X (the 
source code of a computer program) into a target language Y. A conresponding translator program Is 
made in stage 6, when the translator's applteatton code is translated and the previously created files and 
40 the accessory files required by the operating ^stem are linked to it It is preferable to use the PROLOG 
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language, which has a concise and declarative syntax and is thus pre-eminently suitable as the final 
translatDr code and intemiediate language and as test material. 



The conversion instruction and accessory files can be located to t>e in a running file for the desired 
5 platform, such as, for example, MS-DOS. MS-Windows, or Linux. Operationally, the question is, 
however, of difiierent groups of data, no matter whether they are separate files, in a common database, 
or inside the program to be run. 

The translatton is earned out (Figure 2) by a system, generally a PC apparatus, in which there is 
10 - a translator (X>Y), generally a program connected to an input file 7, which translator 

reads the input file with the aid of a scanner 8 and generates a converted version of the 
computer program, and in which the translator includes conversion instructions VX>*VY, 
41 relating to each input-language tenn, 

a translated file 12 connected to the translator (X>Y) for receiving the translated version 
15 of the computer program thus translated, 

an operations library, in which are the routines to be called from the translator (X>Y). 
The system also includes: 

a first accessoiy file VX(c), 31 c containing the parsing logic of the input language for the 
selected semantic descriptive language (V), 

20 - a second accessory file P(Y(d,e), containing the generafing and fomri^^ 

input language, 
in which case the translator (X>Y) is arranged: 

to convert the lines of code of the computer program, first of all into the fbnm of the 

descriptive language, stage 9, using the parsing logic of the first accessory file VX(c); 
25 - then to convert them, stage 11, in ttie descriptive language form using a so-called 

conversion instruction VX»^VY,41 
- and to generate and format, stage 1 1 , the descriptive language target-language code in 

the fomial code 12 of the target language, using the target language's generating code 

VX(d). 42 and format clauses Y(d), 42. 

30 

In addition, the data types VX(b) and VY(b) of both languages should be avsulable In the various stages. 
Thjsse too are stored for the translator, unless they are already contained in the other data. 



Figures 3 and 4 show the Interactive connection of the graphical interface. In Rgure 3, the first page of 
35 the interface has selection windows 20 and 21 for the proposed terms, selection windows 23 and 24 for 
their occurrences, a selection window 22 for the name of ttie get-clause, a selection window 25 for the 
conversion instruction being formed. In most of the selection windows, the temis can be cliclced from a 
selection list to a pop-up window (not shown). In each such selection window, each component acts as 
a link to a corresponding selection list that appears. 



40 
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In Figure 4, the same reference numbers as above are used for operationally similar selection windows. 
In addition, In this case there are selection windows 23' and 24' for the source-language forms of the 
occurrences and selectton windows 26 and 27 for the occurrences of the term. 



5 CONNECTION OF TERMS. PROCEDURE. Figure 3 

The X-temrt and Y-temn are selected, in this case PROGRAM and STATEMENT 

The Connect button is clicked, if the connection takes place on the level of the master ternis 

(either completely ktentk^l terms, or tte connedk^ 

The Guess button is clicked, If the master tenns are close to each other the devetoper will 
10 then 'guess' all the alternative connections. 

Individual connection is carried out one occumence at a time, in this case COMMANDLIST 
and cs(COMPOUNDSTMNT1). It is canied out through the Define button, using the dialogue 
Terni matching" (Rgure 4). 

The code created can be edited aftevwards in the manual-editing window. 

15 

MATCHING OF TERMS. PROCEDURE, Figure 4: 

- The master temis are selected under EXPRESSION and 6ENEXPRESSION 

- Initially, the occurrence of "add" con^sponding to addition is selected from the input language. The 
developer may propose "math_oper" as the cbnesponding occurrence in the target language, as the 

20 term "+* appears in the syntax of both. 

- The ''code framework^ button is clicked, in whteh case the conversion Instruction header field appears 
together with its parameters in the display. The first parameter Is the data sbucture of the input language 
and the second parameter is the data structure of the target language. The Intentton is to get them to 
correspond to each other, l.e. to create a sennantk; connection between them. Compatibirrty is obtained 

26 by cricking the Y-language parameters so many times that the situatton is reached in whfch the number 
of the parameters of the X and Y-languages are the same. If necessary, standard data is used in the 
new language, or the surplus parameters of the input language are marked as unnecessary (by 
underlining). In this case, the mathematical addition operator of the target language must be selected 
manually. The selection takes place by clicking the word MATH^OPER once, when the developer will 

30 pnssent its sub-aHiematives, which in this case are plus, minus, star, and div. 

- It will now be notfeed that there are the same number of variables In the input and target parameters. 
In the input parametere there are the variables EXPRESSION1 and EXPRESSI0N2, and in the target 
parameters the variables GENEXPRESS10N1 and GENEXPRESSION2. It will further be noticed that 
the terms EXPRESSION and GENEXPRESSION are now being connected together mat this connection 

35 can be exploited recursively to matoh the internal input and target parameters of the tenn. 

- Finally, the button "devetop code" is clfcked, when the developer will create the necessary conversion 
instrucfion by generating the tower-level conversion calls for the oonversbn between the input and target 
tenns. The generation Is now comptete. The final result obteined is a PROLOG-language predicate 
(clause), whfch includes a header section and a condition section, separated by T, i.e. the symbol ">", 

40 The name of the header sectton is formed from the name of the input-language terni, two underlinings 
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and the name of the target-language tenn. The parameters are the input and target-language terms with 
their sut>-parameters. The condition section Includes predicate calls to other predicates, formed to bifeak 
down the sub-parameters. The operation of this predicate is descrit)ed in greater deteil in connection 
with the translation of the example code. 

5 

If the translation is made from a higher-level language to a lower-level language, some temns or only 
some occurrence many remain unconverted. The source code conteined In such componente cannot 
then be translated entirely automatically, instead, such componente must be replaced manually in the 
terget language. 

10 

Figure 5 shows the structure of language X, in this case Minllan. 

The concept at the farthest left-hand side is PROGRAM, which is the language's starting symt)ol, 13. It 
is divided into sub-componente through a list-definition. The list is composed of concepte 14 called 
COMMANDS, which in this case have a semantic name in the fonii assignment, 16, i.e. location, and 

15 loop, 15, l.e. program loop. The concept PROGRAM, 17 may reappear inside the program loop, and 
refers to a short piece of program. In most source languages, these would be separated from each 
other, Le. the storting symbol might be called PROGRAM and the program blocks BLOCK. In the 
hierarchy tree of the Minllan language, there are at most six levels. For exampte, the corresponding tree 
for the Pascal language branches into a maximum of 20 levels white the C-language structore tree 

20 fomis about 30 levels, due to the nested structure of the language. In the publication Turbo C Reference 
Guide, 1987, ISBN-0-87624-160-3, the syntax of the grammar of C is presented term by term. A 
stmctural presentation of the Microsoft Visual Basic 6.0 language in the form of Figure 5 contains about 
700 lines. 

25 As one progresses fypk:ally downwards (i.e. to the right) in the diagram, every second term is written in 
capital tetters and every other temn begins in small letters. The capital letters represent the production 
of a grammatical concept (dark background) l.e. a master term and the terms beginning with small 
letters represent occurrences, which b^ln with a semantic name. Cells beginning with "— >" are links 
to a new language, in this case C (light-grey background). At the beginning of the link, the name of the 

30 new language production is stated while after the vla-sign comes the semantic concept, which 
corresponds in the new language to the temn in the X language. The semantte concept (light-grey 
background) is fomied automatically in the translator developer, with the aid of interface operattons. 

As will l>e clear to one versed in the art, the method has numerous applk^tions in the various fiekJs of 
35 programming technique. The following presente applfcations. in all of which at least one input language 
grammar is used and at least one target language grammar, as well as crossings between them 
according to the method: 

1. A translator from one tenguage to another is devetoped, for example, Pascal-language source code 
is transteted into C-language code. The languages are crossed according to the method. 
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2. Protocol software is developed in embedded systems. A domain specific language (DSL) is 
developed, which is based, for example, on C, but which contains additional calls to the object areas 
library routines and central data types. The protocol grammar Is then crossed with this new domain 
specific language. The protocol language can be. for example, the language called CSN.1. which is 

5 used in GPRS and UMTS systems. Using the method, automatically ready-constructed codecs and 
decodecs of the protocol applications for the relevant data-transfer interfaces are obtained. 

3. Configuration applications for industrial products or software applications are created in such a way 
that the input language is a domain specific language (DSL) and the target language is a selected 
programming language. The tenms of the domain specific language indude product characteristics data 

10 direcbries, or file systems, which are defined as grammar. The customer requirements are entered with 
the aid of an interface, as a result of which input-language text Is created. It is converted with the aid of 
a crossing mechanism to fonm, for example, C-language product-configuration software. 

4. The programs are translated to a limited-vocabulary language. The input language is the 
programming language and the target language is the desired sub-group of a natural language with its 

15 clause order. 

5. The program code is transfenred to CASE-means software development use. The software can be 
defined in a high-level language (domain specific language) at the start of a project As the definition 
becomes more precise, a transfer is made to a traditional CASE-means taking into account the interface 
and data presentation fomnats of the CASE means, for example, the known UML presentatton format 

20 6. The data structure and object structure data are selected from the program code and are converted 
to CASE-means graphical symbols and internal data stnjctures. The part of the CASE-means forms the 
target language and the programming language the input language. 

7. Test data is devetoped automatically for the applicatton software using a separate test language, 
which is crossed with the applk^atton language. The test language can act. for example, as a customer 
25 simulator, whteh gives commands and terget data according to an Internet interface. 

The development of a translator between the COBOL and JAVA languages can be given as an example 
of the size of the files and the amount of wori^ required. Manual fomial descriptions of the grammars are 
600 - 1000 lines. The automatically made accessory files are 8 - 10-times this size. The conversion 
30 instruction file is created when using a developer, partteularty an interactive interface, within a few days. 

Example program 

In the example program, there are commands according to the grammar, an assignment clause, and 
35 a repeating loop, whteh contains an incrementatton command: addition and assignment of the result 



to a variable: 



1 = 0. 



; assignment to variable 
; loop, in which there is an assignment, within which there is 
a further additton operatton 



repeat I = 1+1. 



40 



until K11. 
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The program's parsing tree after the scanner and parser is as follows: 

[asslgnnrient(varfi"),numi3er(0))Joop{[assignment{vaK^^ 

number(11))] 

5 start of translation 

The translation starts directly from the parsing tree without an intermediate stage (processing of the 
symbol table and reservation of the variables may precede the translation). 

10 The generation of the input language parsing tree is started by exploitirig directly the 

conversion instruction made between the tenms. The generating code obtained from the translator 
generator is as follows: 

gen_PROGRAM(X, Str) 
15 get_PRCX3RAM_STATEMEaiT{X,Y), genjr_STATEMENT (Y, Str), I. 

2. At the starting stage, the variable X has the value: 

X = [assignment(varrn,number{0)) joop(Iassignment(var(T),add(value(va^^ 
var(T),number(11))] 

20 The variable Str returns the translation result as a character string. The vanab\e Y contains the target- 
language equivalent of the variable X after the conversion get_PROGRAM_STATEMENT. 

3. At the starting stage, the call stadc of the translator is as follows (the caU performed first is the 
lowest): 

25 1 genjprogramC [aasigmnent (var ("i") ,nt3inber(0) ), loop ( [assignment (var("i") , 

acW<value(var(''i")),n\iinber(l)))],var("i")^nuinber(ll))], _ ) 

2 activate_translator ( ) 

The parsing tree is interpreted as a ST ATEMENTUST structure of the target language (C). The call 
30 stack has the following appearance: 

1 get program statementlist ( [assignment {var ("i") , nuinber (0) ) , 

loop ( [assignment (var ( "i") , add (value (var ( "i") ) , nmnber (1) ) ) ] , 
var("i-),nmnber(ll))], _ ) 
35 2 get program statement ( [assignment (var ( "i" ) , number ( 0 ) ) , 

loop ( [assignment (var ( "i" ) , 
add(value(var("i'')),number(l)))],var{"i"),number(ll))], _ ) 

3 genjprogram ( [assignment (var ( "i " ) # number ( 0 ) ) , 

loop ( [assignment (var ("i" ) , add (value (var ( "i" ) ) , nuinber (1 ) ) ) ] , var { "i " ) , 
40 nuinber (11))], _ ) 

4 activate translator () 
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The list STATEMENTLIST is divided into blocks, which are begun to be translated, starting from the 
beginning of the list If there are blocks in the list, according to the Prolog deftnitk>n the procedure moves 
to the second line, in vihkh the first value of the list is obtained for the variable H1: 

get PROGRAM STATEMENTLIST ([]/[]):- ! . 
5 ge t_PROGRAM_STATBMENTLIST ( [HI | Tl ] , [H2 1 T2 ] ) : - 

get_COMMRND STATEMENT (HI, H2), 

get__PROGRAM_STATEMENTLIST(Tl, T2) , !. 

At this stage, the value for the variable H1 becomes: 
10 HI = assignment(varfi").number(0)) 
and the variable T1 is assigned the value 
T1 = [k)op([assignment(var(T),add(value(var("i")).number(1)))], 
var("i^,number(11))] 

The variables H2 and receive a value only at the terminating stage of the translatton 
15 (H2 is thus _ and T2 is also J. 

In the following stage, the first command, the assignment clause, in converted into the semantic of 
the target language, in whteh case the call stack is as follows: 

20 1 get cammand statement { assignment (var ( "l" } , number ( 0 ) ) , ) 

2 get program statementlist ( [assignment (var ("i" ) , number (0) ) , 
loop ( [assignment (varCi") ^ add (value (var ("i") number (1) ))], 
var(''i"),number(ll))], _ ) 

3 get program statement ( [assignment <var ( "i") , number (0) ) , loop ( [assignment 

25 (var(''i"),add{value(varCi")),number(l)))],var("i")/number(ll))l, _ ) 

4 gen_program( [assignment (var("i") , number (0) ) ,loop( [assignment (varCi") , 
add(value(var("i")), number (l)))),var("l''),nuiiiber(ll))], _ ) 

5 activate^translator ( ) 

30 The repeatHjntil ]oop Is translated by first dealing with the commands of the k)op and finally the 
control structures. The call stack inside the loop is as foltows: 

1 get program statementlist ( [loop ( [assignment (var { "i" ) , add (value 

(var ("i")), number (1)))], var ("i") .number (11) )) , _ ) 

35 2 get ^program statementlist ( [assignment (var ( "i" ) , number ( 0 ) ) , loop ( [assign 

ment (var ("i") , add (value (var ("i") }, number (1) ) ) ] ,var ("i") , number (11) )}, _ ) 

3 getjprogram ^statement ( [assignment (var ("i") , number (0) ) , loop ( [assignment 

(var("i"), add(value(var("i")),number(l)))],var(''i"),number(ll))], _ ) 

4 gen _j)rogram( [assignment (var ("i") , number (0) ), loop ( [assignment (var ("i") , 
40 add(value(var("i'')),number(l)))],var("i"),number(ll))l, _ ) 

5 activate_translator ( ) 

The conversion code corresponding to the program loop is as foibws: 

get_PROGRAM_STATEMENTLIST ( (HI I Tl ] , [H2 | T2 J ) : - 
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get_COMMaNp_^STATEMENT(Hl, H2) , 
get_PROGRAM_STATEMENTLIST (Tl, T2) , ! . 

The program operates in such a way that the left parameter acts as the input parameter in the list 
5 fbmnat. the start of which list being the temi H1 and the end of the list the tenn T1. After the conversion, 
the list of the farthest right-hand parameter is fonmed, the first value of which is Ft2 and the end of the list 
T2. Because the input list is a Command type and the target list is a Statement type, the conversion of 
the first pair of variables (HI -> H2) requires the call getjCOMI>MND_STATHVIENT, which returns the 
value H2. After the perfonnance of this conversion, the principal operation calls the parameter values T1 
10 and T2, which are further broken down recursively into the initial and final temis of the list, as defined in 
the Prolog language. 

Coming to the code before the instancing of the variables, the variables have no value 
(H1«^T1=_.H2 = _ja T2 = J. 

15 

The first values of the variable pair (H1 ja T1 ) after the assignment clause: 

HI = assignment (var("i") , number (0) ) 

Tl =• [loop([assigninent(var('»i"),add<value(var("i")),nuinber{l)))3, 
var ("l") ^number (IX) ) ] 

20 The variables H2 and T2 have Initially no value (their States are H2=:_, T2=:.accordlng to Prolog's 
definitibn). 

After the performance of the assignment clause, the variable H1 has the value: 
HI « loop([assigxment(var(''i''),add(value(var("i'')),nuinber(l)))], 
25 varfi"), number (11)) 

The program terminates with a loop command, so that Tl receives the value Q (empty group). 

Next, the translation of the loop is carried out using the conversion instruction 
30 geLCOMMAND_STATEMEMT: 
(Example of Rgure 4) 

get_COMMAND ^STATEMENT (loop (PR0GRAM1,VAR2, EXPRESSIONS) , 

is (do (STATEMENTlr generate (relatxve_oper (var (VARl) , It ( ) , 6ENEXPRESSIQN3) )))):■- 

35 

get PROGRAM STATEMENT (PROGRAMl, STATEMENTl) . 

get JEXPRESSION ^GENEXPRESSION (EXPRESSI0N3 , GENEXPRESS10N3 ) , 

get_VAR_VAR ( VAR2 , VARl ) , 

40 

Here the translator meets the "loop" command, in Figure 4 a call follows to the fomned clause, which 
contains a target-4anguage (descriptive language) term and the sub-temis of both sides. From this call, 
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the translation program forms calls to the clauses defined by the divisible sub-tenms and recursively 
always new calls, until indivisible clauses can be retumed for the variables. 

In this case, the values of the variables in the loop are as follows: 
5 PROGRAMl « [assignment (var("i"),add(value(var("i'')),nuniber(l)))] 
VftR2 » var("i") 
EXPRESSIONS = number (II) 

In the above command, the variables STATEMENT1, VAR1 Ja GENEXPRESSI0N3 have no value 
10 in the place shown by the arrow. 

The assignment clause inside the loop is as follows: 

get_coinmand statement ( assignment (var("i") , add (value (varCi") ) , 
numiberd) )) , _ ) 

15 

get_CC»MAND ^STATEMENT (assignment (YARl, EXPRESSI0N2) , 

expr (asse (generate (relative_oper (GENEXPRESSIQNlreqO ,GENEXPRESSI0N3) )))):- 
get_VAR_GENEXPRESSION(VARl,GENEXPRESSIC»il), 
get_EXPRESSION ^GENEXPRESSION (EXPRESSI0N2^GENEXPRESSI0N3) , 

20 !. 

The variables in the place shown by the anpow are: 

VARl = var("i") 

BXPRESSI0N2 = add (value (var ( "i" ) ) , number (1) ) 

25 

The variables in the place shown by the thick anrow are: 

YARl = var("i") 

EXPRESSI0N2 « add (value (var ("i") ) ^number (1) ) 
6ENEXPRESS1OT1 ^ var (var ("i", 0) ) 

30 

The variables after the line shown by the thick anrow are: 

VARl ^ var(-i") 

EXPRESSI0N2 - add (value (var ("i") ) ,nuraber(l) ) 
GENEXPRESSIom « var (var ("i", 0) ) 
35 GENEXPBESSXQN3 = math_oper (var (var ( "i" , 0 ) ) , plus , const ( i ( 1 ) ) ) 

The call stack in the assignment clause inside the loop is as follows: 

1 get^program statement ( [assignment (var { "i" ) , add (value (var ( " i" ) ) , 
40 ntanbGrd)))], _ ) 

2 get command statement { loop ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
number(l) ))], var ("i"), number (11)), _ ) 

3 get program statementlist ( [loop ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
nuinber(l)))l,var(-i"), number (11))], ) 



wo 02/093371 



PCT/n02/00411 



14 

4 getjrogram ^statementlist { [assigment (var( number (0) ) , 

loop( [assignment {var("i") r add (value {var("i") ) , number (1) ) ) ] , 
var("i") ,nuinber(ll))], _ ) 

5 get jprogram statement ( [assignment (var { "i" ) , number (0 ) ) , loop ( [assignment 
5 (var("i"), add (value (var ("i") ), number (1) ))], var ("i") ^number (11) )] , _ ) 

6 gen_program( [assignment (var ("i") r number (0) ) , 

loop ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , number ( 1) ) ) 3 , 
var(''i*'),nuinber(ll))], _ ) 

7 actlvate_translator ( ) 

10 

At the Start of the loop, the variable PROGRAM receives the value: 
PROGRAM = [assignment (var("i"),add(value(var(''i'')), number (1)))] 

The call stack at the end of the assigrtment clause Is: 

15 1 get_program ^statementlist ( [assignment (var ("i") , add (value (var ("i") ) , 

number (1) ))]/_) 

2 get program statement ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
nunber(l)))], _ ) 

3 get command statement ( loop ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
20 number(l)))],var("i"),number(ll)), _ ) 

4 



The variables at the end of the clause are: 

HI = assignment(var(''i"),add(value(var("i")),number(l))) 
25 Tl = [J 

H2 « expr (asse (generate (relative_oper (var (var ( "i" , 0 ) ) , 
eq,math_pper (var (var ("i", 0) ) ,plus, const (i (1) ) } ) ) ) ) 

T2 « 



30 Variable H2 thus contains the target-language equivalent to variable H i. 



The call stack in the assignment clause inside the k>op is: 

1 get program statement ( [assignment (var ( "i" ) , add (value (var { "i" ) ) , 

number (1) ))],_) 

35 2 get command statement ( loop ( [assignment (var ( "i " ) ^ add (value (var ( "1 " ) ) , 

numberd) ) ) ] , var ("i") , number (11) ) , _ ) 

3 get program statementlist ( [loop ( [assignment (var ( "i" ) # add (value (var ( "i" ) ) , 
numberd)))], var ("i") ^nuiriber (11) ) ] , _ ) 

4 

40 

The value of the loop's internal code is in the variable PROGRAM: 

PROGRAM =^ [assignment (var ("i") ,add(value(var("i") ), number (1) )) ) 
The corresponding target-language value In the variable STATEMENTLIST1 = 
[expr (asse (generate (relative_6per (var (var Cl^fO) ) ,eq,math_oper (var(var ("i", 0) ) 
46 ,plus,const(i(l) ))))))] 
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The call stack in the situation, when returning from the loop: 

1 get^command statement( loop( [assignment (varC"!") , add (value (var("i") ) , 

number (1) )) ], var ("i") ^number (11) ) , _ ) 

5 2 getjprogram stateiiientlist( (loop ( (assignment (var("i") , add (value (var("i") ) , 

number (l)))],var("i"), number (11)) 3, _ ) 
3 get program statement list ( [assignment (var ( "i") , ninnber (0) ) , 
loop ( [assignment (var("i") , add (value (var("i") ) , number (1) ) ) ] , 
var ("l"), number (11))], _ ) 
10 4 

The variables after the return from the loop are: 
PROGRAM1 = [assignment(varf r)*add(value(varCr))>number(1)))] 
VAR2 = varrr) 
15 EXPRESSIONS = mimber<11) 

STATEMENT1 = cs(stmntIist(Iexpr(asse(generate(relatiyejoper(var(v^ 
eq,math_oper(var(varCi".0)).pius.const(i(1)})))))])) 
VAR1=_ 

GENEXPRESSION3 = _ 

20 

In the loop command conversion stage the call stack is as follows: 



1 get_command ^statement ( loop ( [assignment (var ( "i " ) , add (value (var ( "i" ) ) , 

numberd)))], var ("i") , number (11) ) , _ ) 

2 getjprogram ^statementlist ( [loop ( (assignment (var ("i" ) , add (value (var ( "i" ) ) , 

25 numberd)))], var ("i") , number (11) )] , _ ) 

3 get program statement list ( [assignment (var ( "i " ) , number ( 0 ) ) , 



loop ( [assignment (var ("i") , add (value (var ("i" ) ) , nuinber (1) ) ) ] , 
var ("i"), number (11))), _ ) 
4 

30 

In this stage, the values of the variables are: 

PROGRAMl « [asslgimtent (var ("i**), add (value (var ( "i**) ), nuinber (1) )) ] 

vaR2 = varCi") 

EXPRESSIONS = number (11) STATEMENTl = 
35 cs(stmntlist ( [expr(a3se(generate(relative_oper(var(var(''i**,0) ) ,eq,math_oper (var (va 
r ("i" , 0) ) ,plus, const (i (1) ) ) ) ) ) ) 1 ) ) 
YARl « _ 

GENEXPRESSI0N3 « const (i (11)) 

40 The can Stack in the loop is: 

1 get^coramand statement ( loop ( (assignment (var ( "i" ) , add (value (var ( "i" ) ) , 

number(l)))], var ("i") , number (11) ) , _ ) 

2 get program statementlist ( [ loop ( (assignment (var ( "i" ) , add (value (var ( "i " ) ) , 
numberd)))], var ("1") , number (11) )] , _ ) 
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3 get program statementlist ( [assignment (var { "i" ) # nxxmber ( 0 ) ) , loop ( [assign 
mentCvarCi"), add (value (var ( "i") number (1) ))] ,var{"i"),nuinber (11) )] , _ ) 

4 

5 After the call, the variables are: 

PROGRAMl «= [assignment(var("i''),add(value(var("i")),nuinber(l)))] 
VAR2 « varCi") 
EXPRESSIONS *= nuinber(ll) 

STATEHENTl » cs (stmntlist ( [expr (asse (generate ( 
10 relative_oper (var(var ("i^^O) ) ,eq,math_oper{var (var ("i",0) ) , 
plus,const(i(l) ))))))])) 
VARl » var("i",0) 
GENKXPR$:SSI0N3 » const (1(11)) 

15 After the solution of the axielcx>p, the variable situation is: 

HI = loop([assignment(var("i''),add(value(var("i'')),nuna;)er(l)))], 

var Ci"), number (ID) 
Tl - [] 

H2 is (dp (cs(stmnt list ( [expr(asse (generate ( 
20 relative_oper (var (var ("i", 0) ) , eq^math^oper (var (var ( "i", 0) ) ,plus, 

const (i (1) ))))))])) r generate (relative_oper (var (var ( "i", 0) ) , It, 

constddl)))))) 
T2 = _ 

25 When solving the acldition clause, the call Stack is: 



1 get_expression ^genexpression ( add (value (var ("i") number (1) ) , _ ) 

2 get command statement ( assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
number (1) ) ) # _ ) 

3 get program statementlist ( [assignment (var ("1") , add (value (var ("i") ) , 
30 number(l)))], _ ) 

4 getjprogram statement ( [assignment (var ("i") , add (value (var ("i") ) , 

number (1) ) ) ] , _ ) 

5 get command statement ( loop ( [assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
number(l)))], var ("i") , number (11) ) , _ ) 

36 6 get jprogram statementlist ( [loop ( [assignment (var ("i") , add (value (var ("i") ) , 

number(l)))), var ("i") , number (11) )] , _ ) 

7 get jprogram_statementlist ( [assignment (var ("i") , number (0) ) , loop([as8ign 
ment(var("i"),add(value(var(''i")),number(l)))],var("i"), number (11) )] , _ ) 

8 get jprogram ^statement ( [assignment (var ( "i" ) , number (0 ) ) , loop ( [assignment 

40 (var("i"), add (value (var ("i") ) ,number(l) ))], var ("i") ^nuinber (11) )] , _ ) 

9 genjprogram( [assignment (var ("i") ,number(0) ) ,loop( [assignment (var ("i^) , 
add(value(var("i") ),number(l))) ], var ("i"), number (11))], _ ) 

10 activate_translator 0 



45 

In the 



loop's Internal assignnnent clause, the call steck is as follows: 
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1 get_conimand statement < assignment (var { "i" ) , add (value (var ( "i" ) ) , 

nuinber (1) ) ) / _ ) 

2 get program statementlist ( [assignment (var ( "i" ) ^ add (value (var ( "i" ) ) , 
number (1) 

5 3 get program statem en t ( I assignment (var ( "i" ) , add (value (var ( "i" ) ) , 
number (1)) ) ] r „ ) 
4 

After carrying out the addition, the variables are: 

10 vmi = var("i") 

EXPRESSI0N2 « add (value (var ("i-) ) .number (1) ) . 
GENEXPRESSIONl « var (var ("i-^O) ) 

GENKXPRESSIONS « math_oper (var (var ("i", 0) ) ,plus, const (i (1) ) ) 



When the loop terminates, the call stack is as follows: 

1 get program statementlist ( [], _ ) 

2 getjprogram ^statementlist ( [loop ( [assignment (var ("i") , add(value (var ('•i") - 

), number(l)))3r var ("i") , number (11) )] , _ ) 

20 3 getjprogram ^statementlist ( [assignment (var ("i") , number (0) ) , loop ( [assign 

ment(var("i''), add(value (var ("i") ) ,number(l) ))) ,var("i'') , number (11) )) , _ ) 
4 

When the program terminates, the call stack is as follows: 

25 1 getjprogram ^statementlist ( [assignment (var ("i") , number (0) ), loop ( [assign 

ment(var("l"), add(value(var("i") ) .number (1) ) ) l,var ("i") ,number(ll) ) J , _ } 

2 getjprogram statement ( [assignment (var ("i") , number (0) ) , loop ( [assignment 

(varCi"). add(value(var ("i") ) .number (1) ))] ,var("i") , number (11) )] , _ ) 

3 gen j>rogram( [assignment (var ("i") .number (0) ) .loop ( [assignment (var ("i") . 
30 add(value(var("i")).number(l)))].var{-i"). number (11))]. _ ) 

4 activate_translator ( ) 

The variable situation at the termination of the program: 

36 HI » assignment (var ("i") .nt2mber(0) ) 

Tl = [loop([assignment(var("i''),add(value(varCi")).number{l)))). 
var ("i") .number (11) ) ] 

H2 « expr (asse (generate (relative_oper (var (var ("i". 0) ) , eq, const (i (0) ) ) ) ) ) 
T2 = [is (do (cs(stmntlist([expr (asse (generate (relative_oper( 
40 var (var ( "i" . 0 ) ) , eq, math_oper (var (var ( "i" . 0 ) ) . plus , const (i (1 )))))))])) . 
generate (relative_oper (var (var ( "i" . 0) ) , It, const (i (11) ) ) ) ) ) ] 

1 getjprogram statement ( [assignment (var ("i") , number (0) ) .loop( [assignment 

45 (var("i"). add( value (var ("i")) , number (1) ))), var ("i") .number (11) )] . _ ) 
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2 gen^program ( [assigiment (var ("i") ^nimiber (0) ) ,loop( [assignment (var ("i") , 

adcKvalue (varCi") ) ,niiinber(l) ) ) ] ^var("i-) ^number (11) ) ], _) 

3 activate_translator 0 

5 Rnally, a taiget-language parsing tree oorresponding to the original program PROGRAM is obtained 
in the variabie STATEMEI4TUST1: 

PROGRAM ~ [assignment (var Ci") ^number (0) ) , loop ( (assignment (var ("i") , 

add (value ( var ( "i" ) ) # nxmiber (1) ) ) ] , var ( "i" ) , number (11) ) ] 
STATEMENTLISTl = [expr (asse (generate (relative_oper (var (var ( "l", 0) ) , eq, 
10 const (i(0) ))))), 

is (do (cs (stmntlist ( [expr (asse (generate (relative_oper (var (var ("1", 0) ) , 

eq,math_oper (var (var ("i", 0) ) , plus, const (1(1)))))))])), 

generate (relative_oper (var(var ("i^^.O) ), It, const (i (11) )))))] 

15 

Rnally, the procedure moves to the input-language program's code's clause genjprogram. in which only 
now the solution is initiated of the target-language syntax variable Str. At this stage, the call stack is: 
1 genjprogram( [assignment (var ("i") , number (0) ), loop ( (assignment (var ("i") , ' 

add(value(var("i")),number(l)))l,yar("i"),nuniber(ll))l, _ ) 
20 2 activate_translator () 

The variable situation prior to the definition of the target-language syntax is as follows: 

X * [assignment(var(''i''),nunber(0)),loop( [assignment (var ("i"), 
add (value (var ("i") ) , number (1) ) ) ] , var ("i") , number (11) ) ) 

25 Str = _ 

Y = cs (stmntlist ( [expr (asse (generate (relative_oper (var (var ( "i" , 0) ) , 

eq, const (i (0) ))))), is (do (cs (stmntlist ( [expr (asse (generate (relative_oper ( 
var (var Ci'SO) ) ^eq^math^oper (var(var ("i", 0) ) , plus, const (i (1) ))))))])) , 
generate (relative_oper (var (var ("i", 0) ) , It, const (i (11) ) ) ) ) ) 1 ) ) 

30 

Gen clauses 

In the following, code generation is started to define the syntax portion of the target language: 

35 1 gen_y_statement ( cs (stmntlist ( [expr (asse (generate (relative_oper( var ( 

var (-i"/0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist ( (expr (asse ( 
generate (relative_oper (var (var("i",0) ) ,eq,mathjoper (var (var ("i", 0) ) , 
plus, const (id)))))))])), generate (relative_oper (var (var ( "i", 0) ) , It, 
const(i(ll) )))))])), _ ) ' ' 

40 2 gen_program( [assignment (var ("i") , number (0) ), loop ( [assignment (var ("i") , 

add(value(var("i"))»nuniber(l)))l,var("i")#number(ll))], _ ) 
3 activate__translator ( ) 

The variable at the start when moving to the compound statement 
45 OOMPOONDSTMNTl stmntlist ( [expr (asse (generate {relative_oper( var ( 
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var ("i", 0) ) , eq, const (i (0) ))))), is (do (cs (strontlist ( [expr (asse (generate ( 
relative_oper (var (var ("i", 0) ) , eq,inath_oper (var (var ( "i" , 0) ) , plus, 
const (i (1) ) ) ) ) ) ) J ) ) r generate {relatlve_oper (var (var ( "i", 0) ) , It , 
const (1(11) )))))]) 
6 Str « _ 

Strl « _SList « _ 

the following, the definition of the syntax of the C-language assignment clause i=0 Is started: 
^^'^-y^statement { expr (asse (generate (relative_oper (var (var ("i", 0) ) , 
eq,const(i(0)))))), _ ) 

gen_y_stateinentlist ( [expr (asse (generate (relative^oper (var (var ("l", 0) ) , 
const (i (0) ))))), is (do (cs (stmntlist ( [expr (asse (generate ( 
relative_oper(var(var(''i'',0) ) ,eq,]nath_oper (var (vaic ("i^fO) ) ,plus, 
const (i (1 )))))))])) , generate {relative_oper (var (var ( "i" , 0 ) ) , It, 
const (1(11)) ))))], _ ). 

^e'LX-Con^^dstinnt ( stmntlist ( (expr (asse (generate (relatlve_oper (var ( 
var ( "1", 0 ) ) , eq, const (1 (0) ))))), Is (do (cs (stmntlist ( [expr ( 
asse (generate (relative_oper (var (var ("i", 0) ) , eq,matli_oper (var ( 
var ("1", 0) ) ,plus, const (i (1) ))))))])) , generate (relative_oper ( 
var(var(-i",0)),lt,const{i(ll) )))))]), _ ) 



In the assignment clause, the variable EXPRESSION1 receives the value: 

EXPRESSION! =» asse (generate (relative_oper (var (var ("i",b) ) , eq, const (i (0) ) ) ) ) 

25 Its conversion into source language takes place using the clause genjoutput 

1 genjoutput ( ["i", "0"] , "genexpression_relative__oper", _ ) 

2 gen_y_genexpression( relative_oper (var (var ("i^^O) ) ,eq,const (i(0) ) ) , _ ) 

9enuy_assignment_expression( generate (relative_oper (var (var ("l", 0) ) ,eq, 
30 conat(i(0)))), _ ) 

3 gen_y_expresslon ( asse (generate (relative_oper (var (var ("i", 0) ) , 
eq,con3t(i(0))))), _ ) 

4 

3S The following value is taken to the generating clause genjoutput 

Slist = [••i","=»',"0"l 

At the same time, the fonmat clause Form is called using the following value: 
Form = ••genexpression_relative_oper*' 

40 Using the value Fonm the corresponding fonnat clause is retrieved from the accessory file CPRM. 
The call stack of ttie assignment clause (N)) is as fdkyws: 

1 genjoutput ( ["1 « 0"], "assignment_expression_generate", _ ) 

2 gen^_assignment_expresslon{ generate (relative_oper (var (var ("i", 0) ) ,eq, 
46 const(i(0)))), ) 
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3 gen_y_expression( asse (generate {relative_oper(var(var("i'',0) ) ,eq, 
const (i(0))))), _ ) 

4 

5 The variable situation in the fbnnation of the assignment clause is: 

Slist « ["i « 0"] 

Form = "assigninent_expression_generate" 
Str - _ 
StrF « _ 

In the Incrementation stage, the call stack is as follows: 

1 gen__output( ["i « i + 1"], "assigniaent_expression_generate", _ ) 

2 genjy_as3ignment_expres3ion(generate(relative_oper(var (varCi",0) ) , 
eq,inath_oper(var(varC±",d)),plus,con3t(l(l))))), _ } 

15 3 gen_y_expression ( asse (generate (relative_oper (var (var ("i", 0) ) , eq, 

iaath_oper(var(var(''i'',0)),plus,const(i(l)))))), _ ) 

4 gen _y_stateinent ( eacpr (asse (generate (relative_oper (var (var ( "1 " , 0 ) ) , eq, 
matli_oper (var(var("i'',0) ), plus, const(i(l) )))))) , _ ) 

5 gen_y_statementlist ( [expr (asse (generate (relative_oper (var (var ( "i", 0) ) , eq, 
20 math_oper(var(var("i",0)),plus,const(i(l) ))))))], _ ) 

6 gen_y_CQinpoundstiiint ( stmntlist ( ( expr (asse (generate (relative_oper (var ( 
var ("i", 0) ) ,eq,inath_oper (var (var ("i", 0) ) , plus, const (i (1) ))))))]) , _ ) 

7 gen^_statexaent ( cs (stmntlist ( lexpr (asse (generate (relative_oper (var ( 
var("i",0)),eq,inath_oper(var(varCi",0)),plus,const(i(l)))))))))), _ ) 

25 8 gen_y_iteration_^st ( do ( cs (stnmtlist ( [expr (asse (genera te(relative_oper (var ( 

var ( " i " , 0 ) ) , eq, math^oper ( var ( va r ( " i " , 0 ) ) , plus , const ( 
id)))))))]))/ generate (relative_oper (var (var ( "i" ^ 0) ) ^ It, 
const(i(ll))>)), _ ) 

9 gen_y_stateinent ( is (do (cs (stmntlist ( [expr (asse (generate (relative_dper ( 
30 var (var ("i", 0) ) , eq,math_oper (var (var ("i", 0) ) ,plus, const (i (1) ))))))])) , 

generate(relative_oper(var(var("i",0)),lt,con3t(i(ll)))))), _ ) 

10 gen_y_statementllst ( [is (do (cs (stmntlist ( [expr (asse (generate ( 
relative^oper (var (var ( "i" , 0) ) , eq,math__oper (var (var ( "i" , 0) ) , 
plus, const (id)))))))])), generate (relative_oper (var (var ( "i", 0) ) , 

35 lt,const(i(ll) )))))], 

11 gen_y_statementlist ( [e3q>r (asse (generate (relative_oper (var (var ("i'',0) ) , 
eq, const (i (0) ))))), is (do (cs (stmntlist ( [expr (asse (generate ( 

rela tive__oper (var (var ( " i " , 0 ) ) , eq, math_oper (var (var ( " i " , 0 ) ) , 
plus , const (i (1) ))))))])) , generate (relative_oper (var (var ( "1 0) ) , 

40 lt,const(i(ll) )))))], _ ) 

gen_y_compoundstmnt ( stmntlist ( [expr ( asse (generate ( relative_oper (var (var ( 
"i", 0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist { [expr (asse ( 
generate (relative_oper (var (var ("i", 0) ) , eq,math_oper (var ( 
var ("i", 0) ) ,plus,const (i (1) ))))))]) ), generate (relative_oper( 

45 var(var("i",0)),lt,const(i(ll))))))l), _ ) 

12 gen_y_statement ( cs (stmntlist ( [expr (asse (generate ( 
relative_aper (var (var ( "i" , 0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist ( 
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[expr (asse (generate (relative_oper (var (var ("i", 0) ) , eq, 
iiiath_oper (var (var ("i", 0) ) ,plus, const (i (1) ))))))])) , 



generate(relative_oper{var(var("i",0)),ltrconst(i(ll))))))])), _ ) 
13 genjprogramC [assignment (var ("i") ,nvanber(0) ), loop ( [assignment (var ("i") , 

5 add(value(var("x")),nuiniber{l)))3,var("i"),nuinber(ll))3, _ ) 

15 activate_translator ( ) 



In the incrementation stage, the fonnatting clause "assignment_expressionjgenerate" is used: 
Slist « i-i = i + 1"] 
10 Form » "assignmentjexpression^generate" 

When fbnning the call stack, the final assignment clause is as follows: 



1 gen_y_expression ( asse (generate (relative_oper (var (var ("i", 0) ) , 

eq,math_oper(var( var ("i", 0) ), plus, const (i (1) ))))) , _ ) 
15 2 gen_y_statement ( expr (asse (generate (relative_oper (var (var ("i", 0) ) ,eq, 

math_oper (var(var("i",0)), plus, const (i(l) )))))), _ ) 

3 gen_y_statementlist ( [expr(asse (generate (relative_oper( var (var ("i**, 0) ) ,eq, 
math^oper (var (var ("i»,0)), plus, const (i(l)) )))))], _ ) 

4 gen_y_coinpoundstinnt ( stmntlist( [expr (asse (generate (relative oper(var( 
20 var("i",0)),eq,math_oper(var(var(''i",0)),plus,const(i(l)))))))]), _ ) 

5 gen_y_statement ( cs (stmntllst ( [expr (asse (generate <relatlve_oper (var ( 



var ("i", 0) ) , eq,math_oper (var (var ("i", 0) ) , plus, const (i (1) ))))))])) , _ ) 
6 

25 After the fbmiatting of the assignment clause, the variables are: 

ASSIGNMENT_EXPRESSI0N1 = generate (relative_oper (var (var ("i",©) > ,eq, 

math_oper (var (var ( "i", 0) ) ,plus, const (i (1) ) ) ) } 
Str « "i = i + l" 
Strl « "i * i + 1" 

30 SList = ["i = i + 1"] 

In the following, the fonnation of a block (conresponding to the content of the input-language k)op) 
above the assignment clause Is commenced: 

1 gen_output( ["i i = i + 1 ; )"] , "statement_cs", ) 
35 2 gen _y_stateinent ( cs (stmntlist( [expr (asse (generate (relative_oper( 
var (var ( "i" , 0) eq,math_oper (var (var ("i", 0) ) , plus, 
const(i(l)))))))])), _ ) 

3 gen_y_iteration_st ( do (cs (stxnntlist ( [expr (asse (generate (relative_oper ( 
var (var ("i", 0) ) ,eq,math_oper (var (var ("i", 0) ) ,plus, 

40 const (1(1)))))))])), generate ( relative_oper (var (var ( "i " , 0) ) , 

lt,const(i(ll))))), 

4 gen_y_statement ( is (do (cs (stmntlist ( [expr (asse (generate (relative_oper ( 
var (var ( "i " , 0) ) , eg, math^oper (var (var ( "i", 0) ) , plus, 

const (i (1) ))))))])) , generate ( relative joper (var (var ("i", 0) ) , 
45 It, const (i(ll) ))))), _ ) 

5 



wo 02/093371 



PCT/FI02/00411 



22 

The values of the variables produced by the code contained in the loop are: 

Slist = [-{ i = i + 1 ; }"] 
Form - "statement_cs'' 

5 

Next, the code of the control structure of the loop is fbnned: 

1 gen_output( ["{ i = i + 1 ; -± < 11"], -iteration_st_do" , _ ) 

2 gen_y_iteration_st ( do (cs (atmntlist ( [expr (asse (generate (relative_oper (var ( 
var ("i-, 0) ) ,eq,math_oper (var (var ("i", 0) ) ,plus, const (i (1) ))))))])) , 

10 generate(relativejoper(var(varCi",0)),lt,const(i(ll))))), _ ) 

3 gen _jr_statement ( is (do (cs (stmntlist ( [expr (asse (generate (relative_oper( var ( 
var ("i", 0) ) ,eqrmath_oper (var (var ("i", 0) ) ,plus, const (i (1) ))))))])) , 
generate(relative_oper(var(var(-i",0)),lt,const(i(ll)))))), _ ) 

4 gen_y_stateiinentlist ( [is (do ( cs (stnmtllst ( [expr (asse (generate ( 
15 relative__oper (var (var ("i", 0) ) , eqrmath_oper (var (var ( "i", 0) ) ,plu3, 

const (i(l) ))))))])) ,generate(relative_oper (var (var Ci^^O) ) , 
lt,const(i(ll) )))))], _ ) 

5 

20 At the beginning of the fbnmatting of the loop, the variables are: 

Slist = ["{ i « i + 1 ; }-,-i < 11"] 
Form « "iteration^st^do" 

At this Stage, the call staclcls: 

25 1 gen_output( ["do { i «• i + 1 ; } while ( i < 11 )"], 

"statement_is", _ ) 

2 gen_y_statement ( Is (do (cs (stnmtlist ( [expr (asse (generate (relative_oper ( 
var (var ("i", 0) ) , eq,inath_oper (var (var ("i", 0) ) ,plus, 

const (i (1) ) ) ) ) ) ) J ) ) , generate (relative_oper (var (var ("i", 0) ) , 
30 It, const (i (11) ))))), _ ) 

3 gen_y_statementlist ( [is (do (cs (stmntlist ( [expr (asse (generate (relative_oper 

(var (var ( "i" , 0) ) , eq,math_oper (var (var ( "i- , 0) ) , plus, const (id)))))))])), 
generate(relative_oper(var(var(''i'',0)),lt,const(i(ll))))))], _ ) 

36 4 gen_y_statementlist ( [expr (asse (generate (relative_oper(var (var ("i",0) ) ,eq, 

const (i(0) ) ) ) ) ) /is (do (cs (stmntlist ( [expr (asse (generate ( 
relative_oper (var (var ("i", 0) ) , eq,math_oper (var (var ("i'*, 0) ) , 
plus, const (id)))))))])), generate (relative_oper (var (var ( "i", 0) ) , 
lt,const(idl) )))))], __ ) 

40 5 gen j/_compoundstmnt ( stmntlist ( [expr (asse (generate (relative_oper (var 
(var ( "i", 0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist ( [expr (asse (generate ( 
relative_oper (var (var ("i", 0) ) , eq,math_oper (var (var ("i", 0) ) , 
plus, const (i (1) ) ) ) ) ) ) 1 ) ) , generate (relative_oper (var (var ( "i", 0) ) , 
lt,const(i(ll) )))))]), _ ) 

45 6 
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The formatting of the loop command begins from the variable situation: 
Slist = fdo { r = i + 1 ;) while ( l< 1 1 )T 
Form = "statementjs" 

5 The fonnatting of the entire program starts from the following call stack: 

1 gen_output( I"i « 0 ; do { i = i + 1 ; ) while ( i < 11 ) 
"compoundstinnt_stiiintlist", _ ) 

2 gen_y compoundstinnt ( stmntlist ( [expr (asse (generate (relative_oper (var ( 
var ("i", 0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist ( [expr ( 

10 asse (generate ( relative_oper (var (var ( "i" , 0 ) ) ^ eq, mathjoper ( 

var (var ( "i", 0) ) , plus, const (i (1) ) ) ) ) ) ) 1 ) ) , generate ( 
relative_oper(var(var(''i'VO)),lt,const(i(ll)))))))), _ ) 

3 genj^statement ( cs (stmntlist ( [expr (asse (generate (relative^oper (var ( 
var ( "i" , 0) ) , eq, const (1(0)))*))), is (do (cs (stmntlist ( [expr ( 

15 asse (generate ( relative_oper (var (var ( "i " , 0 ) ) , eq, math_oper ( 

var (var Ci", 0) ) ,plus, const (i (1) ))))))] ) ), generate ( 
relati ve_oper (var ( var ( "i", 0)), It, const (i (11) )))))])), _ ) 

4 gen_program< [assignment (var ("i") , number <0) ) , loop ( [assignment (var ("1") , 
add(valvie(var(''i")),number(l)))l,var("i"),number(ll))l, _ ) 

20 5 activate^translator ( ) 

The variables are then: 

Slist - ["i » 0 ; do { i » i + 1 ; } while ( i < 11 ) 
Form = "cQmpoundstmnt^stmntlist" 

25 

A group of C clauses (Compound Statement) as generated as source-language C-code using the 
clause: 

gen_Y_COMPOUNDSTMNT ( stmntlist ( STATEMEOTLISTl ) , Str ) : - 
gen_y_STATEMENTLIST (STATEMEOTLISTl, Strl) , 
30 SList «[Strl], 

gen_output(SList, "cQii^)oundstmnt_stmntlist'*, Str), !. 

The call stack is then as follows: 

1 gen_y_corapoundstmnt ( stmntlist ( [expr (asse (generate (relative_oper (var ( 
35 var ("i", 0) ) , eq, const (i (0) ))))), is (do (cs (stmntlist ( [expr ( 

asse (generate (relative_oper (var (var ("i", 0) ) , eq, mathjoper ( 
var (var ("i" , 0) ) , plus, const (i (1) )))))) 1) ) , generate ( 
relative_oper(var(var(-i",0)),lt,const(i(ll))))))l), _ ) 

2 gen_y_statement ( cs (stmntlist ( [expr (asse (generate (relative__6per (var { 
40 var ( "i", 0) ) , eq, const (i (0) ))))), is (do (cs ( stmntlist ( [expr ( 

asse (generate (relative_oper (var (var { "i" , 0) ) , eq,math_oper (var ( 
var ("1", 0) ) ,plus, const (1(1)))))))])), generate (relative_oper ( 
var(var(-i",0)),lt,const(i(ll))))))])), _ ) 

3 gen jprogram( [assignment (var ("i") , number (0) ), loop ( [assignment (var ("i") , 
45 add(value(var("l")),number(l)))],var("i"), number (11))], _ ) 

4 activate_translator ( ) 
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The variable situation is finally: 

STATEMENTLISTl = [expr (asse (generate (relative_oper (var (var ("i", 0) ) ,eqr 
const (i (0) ))))), is (do (cs (stmntlist ( [expr (asse (generate ( 
5 relative_oper(var (varCi^^O) ) ,eq,inath_oper (varCvarCi^^O) ) rPlus, 
const (i (1 )))))))])} , generate (relative_oper (var (var ("i", 0) ) , It , 
const (i(ll) )))))] 
Str » "{ i = 0 ; do { i = i + 1 ; } while ( i < 11 ) ; 
Strl « "i « 0 ; do { i » i + 1 ; } while ( i < 11 ) ; " 
10 SList « ["i = 0 ; do { i = i + 1 ; } while ( i < 11 ) ?"] 

In the final stage, the result of the translator is printed out to the screen using the clause 
dialogLSetStr, in such a way that the third parameter has the printable value: 

15 1 dialog_setstr ( Win, ID, "{i =0 ; do (i -i + 1 ;) while ( i < 11);}") 
2 activate_translator() 

The translation is now complete! 

20 

MATRIX PRINCIPLE 

Every formal language can be depicted as vectors X. X = {X,..X,} and each term can be depicted as 
elements of the vector. In this case, the underilning depicts the fonri of the vector. Thus the relationship 

^ between two languages and the possible conversions with their tenns can be depicted as a relationship 
b^ween the vectors X (input language) and 7 (target language), which is in practice a matrix. A sensible 
result is obtained from a translation between the input and target languages, only if at least one 
semanfic solutions. i.e. conrespondence, can be found in the target language for each tenrn of the input 
language, so that the tenn conesponding to the solution can be applied in the translator and a target- 

30 language portion corresponding to it can be printed out later. 

The invention is depicted as a conversion between two languages using the matrix formula 
Y = [A] * X. in which [A] is a matrix, describing the relation between a vector X an a vector?. In practice, 
the relationship comprises conversion instructions. In one cell Ay of the matrix, there is thus a 
35 conversion instruction between an element i of language X and an element] of language Y, if the 
conversion is possible. 

The selection of the matix according to ttie input language takes place on the basis of the semantic 
name (X,) of ttie numbered terms, virtien the index i is defined while selection on ttie basis of the target 
40 language takes place according to the operating connection, when tfie Index j is defined. The operating 
connection is based on the fonn of a corresponding occurrence, which is defined when creating a 
conresponding higher converston instruction. 
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For example, when an equivalent to the language X assignntent clause assignment(VAR, 
EXPRESSION) is created In a new language Y, in which there is a corresponding term bind{VAR, 
EXPRESSION), It is natural to make a connection from the EXPRESSION master temis to the target 
language's EXPRESSION main term t)etween all their possible occunences. It is preferable to use the 
5 smallest possible number of links between the languages and to select the most natural way to connect 
the teniis to each other. If the link fifom language to language is defined on some level of the grammar, 
then In all other links from an Input-language term to the new language it is possible to refer to the 
aforesaid link directly, if the new Rnk is on a higher hierarchy level. 

10 Example: 

Assume that the Input language's EXPRESSION contains three tenns and links are needed from it to 
the master tenns EXPRESSION, in which there are five tenns and ASSIGNMENT^EXPRESSION. in 
which there are three tenns. If both cases are handled comprehensively, at least six links (3 to both) will 
be needed in the conversion instruction. If ASSIGNMENT.EXPRESSION is a sub-group of the 
15 EXPRESSION tenn, and the link EXPRESSION->ASSIGNMENT.EXPRESSION has already been 
defined, it is possible to refer to the conversion instruction 
get_EXPRESSION_ASSIGNEMENT_EXPRESSION In the link EXPRESSION->EXPRESSION. Thus, 
only three links are needed to the direct terms and additionally a transfer from the master term 
EXPRESSION to the master tenn EXPRESSION. 

20 

On the other hand, when a link is created, for example, from the master term EXPRESSION to the 
master term MATHEMATICAL^EXPRESSION it virill only be necessary to define the mathematical 
clauses of the input language, in relation to the new temn. 

25 Using the aforemenBoned prindple. it Is also possible to implement rnuttHanguage translators, in whfch 
one metric Is the programming language, matrix [AJ and a second matrix [BJ is, for example, a data- 
transfer protocol or an operating system interface or library. 

Thus source code according to the Y language is produced, whrch is the result of the two matrices: 
Y = IA]*IB1*X. 

30 If, in the rmv interface matrix [B] a cell has a sdected value, for example empty, it means that the result 
of the [A] matrix is used as such in the new source code. If the value of the cell is something else, for 
example, a type-converston command, or a text format clause, the corresponding new version is 
exploited. 

35 DIVISION OF THE TRANSLATION INTO SEVERAL STAGES: 

The translation can be divWed into an infinite number of separate stages, which all exptoit the original 
parsing tree as starting date, but which also receive supplementary data from the preceding stages. For 
example, the data of the symbol table of the previous stage are used in the following stage, when the 
object classes are defined and the final code is printed out in the third stage. 

40 7 = IA]*[B1*[C]*X. 
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CREATING SYMBOL TABLES: 

The method is used to create symbol tables, using the X language's parsing tree as starting material as 
follows. The desired tenms of the language are defined as symbol variables in the dialogue or directly in 
the grammar file. A symbol is created using the name of the master tam reserved in the symbol table, 
5 for example, VAR signifies the name of the variable and STRUCT the name of the recorci, I.e. the 
structure. The definition is made from the grammar as follows: SYMBOLTABLE = VAR -> 
variable(VAR), STRUCT -> struct(STRUCT). 

Thus when the program goes through the parsing tree in the first stage of the conversion, aH the variable 
10 references are stored In the cache memory in every case where reference is made to VAR or STRUCT 
type temns. When moving to the generation of a new language, the necessary variable definitions (VAR 
and STRUCT) are printed out at the relevant point at the start of the method or function In question, or, 
for example, at the start of the entire program file. 

15 In ttie following are the grammatical descriptions and files relating to the example, as well as PDL 
algorittims for creating files automatically. In the PDL description, tiie words master term conresppnds 
to the word Production, the word tenm connesponds to the word Term, Uie word occumence to the word 
Subterm. and ttie word indivisible temn to the word Terminal , 

20 1) Inout-lanquaae X Grammar as a formal presentation 2) Descriptive lanauaoe (Prolog^ 



conversion 

The grammar is presented in ttie form Tenm name Occurences 
Occurrence = Syntax fbnn *->" Senianfic.correspondence 



25 



PROGRAM 



COMMAND* separator dot 



COMMAND 



VAR eq EXPRESSION -> assignment(VAR.EXPRESSION), 

repeat PROGRAM until VAR It EXPRESSION -> loop(PROGRAM,VAR.EXPRESSION) 



30 



EXPRESSION = 



EXPRESSION plus EXPRESSION 



-> add(EXPRESSION, EXPRESSION) 



VAR 



-> value(VAR), 

-> numbeF(INTE6ER) 



number(INTEGER) 



36 



VAR 



= name(STRING) 



•^var(STRING) 



3^ X lanouaae glossary Preserved words^ 
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In the ^mple, the reserved words are collected from the grammar, for instance: repeat, umit plus, h, 
dot AH the others except repeat and untU are interal abbreviations. TTie words are stored In a file in the 
fomri strjokCrepeat^repeatl in which the right-hand repeat refers to the semantic portion and the left- 
hand "repeat'* to the syntax portion. 
5 The following is the PDL (programming design language) algorithm. 

* 1 FOR EACH WORD IN GRAMMAR DEFINrnON 

n IFWORDBARESERVEDWORDOO 
*3 STORE rr INTO GLOSSARY FILE 
10 *4 AND MAKE A STRING str_tokCX";X). 

stiL,tokr.",dot) 
str^tokr=\eq) 
sfiLtokCiepeat",rqpeat) 
15 8tr_tokCimtil'',uiitil) 

strjldc("+",plus) 

4) DatatVPeg nf tfiR y lawyniapp. 

The datatypes are collected from the right-hand side of each tenm; assignment, hop, add, van number, 
20 value . 

The following is the PDL (programming design language) algorithm for collecting the datatypes. 
*FOR EACH TERM AND SUBTERM 

* GET TEffi RIGHT HAND SroE(RHS) OF EACH GRAMMAR 

* STORE THE RHS INTO GRAMMARS SEMANTICS AND INTO DATABASE 

CO^/l^4AND* 

asslgnment(VAR,EXPRESSION); 
k)op(PROGRAM.VAR.EXPRESSION) 
add(EXPRESSION.EXPRESSION); 
valiie(VAR); 
number(INTEGER) 
var{STRING) 

35 5) Scanner term fenfire glossary) 



PROGRAM 
COMMAND 

EXPRESSION = 

30 

VAR 
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The term p_tok is a colleclion of all possible language symbols. In the source code, it is infonnation for 
the scanner. The scanner reads Ihe input file and classifies each word in the file. 
The fbllowing is the conresponding PDL algorithm. 

*1 FOR EACH TERMINAL IN GRAMMAR DEFINITION 
5 *2 STORE THE TERMINAL INTO P_TOK-DOMAIN 

P.TOK =dot(): 

eqO: 
repeatO; 
untilO; 
10 ItO; 

plusO; 

number(INTEGER); 

name(STRING); 

nill 

15 

6) X lanQuaae parsing kxaic 

The parsing code (in the Prolog language) is developed automatically using ttie top-down technique. 
The fbllowing is the conresponding PDL - algorithm. 

20 n USE RECmSIVE DESCENT -METHOD (DIFFERENCE m 

*2 FOR EACH PRODUCTION 

♦3 GENERATE A CODE FOR EACH PRODUCTION 

*4 OF FORMAT "s^+ PRODUCTION NAME" Wrram AND LLO AS PARAMETERS 
*5 AND FOUND PRODUCTION TERMS AS AN OUTPUT PARAMETER. 
25 *6 CALL PRODUCnON'sSUBTERMS IN PROPER ORDER 

*7 RENUMBER UST TERMS LLn STARTING FROM! WITH LLO as 6 
♦8 FIND TOKENS OF SYNTAX BY A SUrrABIJBCAIX(pe3q)W 

*9 COLLECT ALL INFORMATION INTO PARSE TREE WTIH STARTSYMBOL AS A ROOT 
*10 GENrntATE SUB-PREDICATES FOR EACH SEPARATE PARAMETER COMBINATION 
30 *11 OF CURRENT PRODUCTION. 

Highest levet: ptxxiram and command tenns 



s_program(LL1,LljO,[COMMAND|PROGRAM]):- 
35 s_oomn)and(LL1.LL2,COMMAND),l. 

8_j)rogram1(LL2.U.0.PRO6RAM). 
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s_program(LL.LL,[]). 

s_program1(It(dotJ|LL1l.LL2,PROGRAM):-l, 

s_progtam(LL1 ,LL2.PR06RAM). 
sj)rogram1(LL,LL,D). 
5 8_oommand(LL1,LL0,assignment(VAR,EXPRESSION)}:- 
s_var(LL1,LL2,VAR), 
pexpect(t(eqJ.LL2.LL3). 
s_expression(L1.3,LL0,EXPRESSiON),l. 
sjcomn»nd([t(repeat,J|LL1],l±0,loop(PROGRAM.VAR,EXPRESSION)):-l. 
10 s_program(LL1,LL2,PROGRAM), 
pexpect(t(untiU,LL2,LL3). 
S_var(LL3,LL4.VAR). 
pexpect(t(lt,J.LL4.LL5), 
s_expression(LL5,LLO,EXPRESSION). 
15 8jcommand(LL^J:-psyntax_error(comnriand,LL).fail. 

7^ X lanaauae Dareina logic. 

Lower level. EXPRESSION and VAR terms 

20 8_»q)rBSslon{LL1,LL0,EXPRESSION):- 

s_expression1(LL1,U.0,EXPRESSION). 
s_expression1(LL1.LL0,EXPRESSJONJ:- 

s_expresslon2(LL1.LL2,EXPRESSION), 

s_expresslon3(LL2,LLjO.EXPRESSION,EXPRE8SIONJ. \ 
25 s_expression2(LL1,LL0,value(VAR)):- 
s_var(LL1,LL0,VAR),l. 
s_expressfon2([t(number(INTEGER),J|LL],LL.number(INTEGER)):-l, 
s_expression2(LL,_,J:-psyntax_error(expre8sion2,LL),fail. 
s_var(P(name(STRING)^|LL].LL.var(STRING)):-!. 
30 s_var(LL^_):-psynlaxjerror(var.LL),1iail. 

sj^resslon3([t(plusJ|LL1].LL0,EXPRESSION.EXPRESSIONJ:-l, 
s_expres8ion2(LL1.LL2.EXPRESSION1). 

s_expression3(LL2.LL0,add(EXPRESSION.EXPRESSION1}.EXPRESSIONJ. 
s_^ression3(LL.LL,EXPRESSION.EXPRESSION). 



8^ X language aeneratinq code 

Generating is ttie opposite operation to parsing. The generating code (in the Prolog language) is 
40 constructed from the master temi records of the database, in such a way that each temi is divided into 
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its occurrences and the final code string (source code to be printed out) is the sum of the sut>-tenns, 
which is formatted with the aid of fonnat clauses. 

The following is the corresponding PDL algorithm, 
n FOR E4CH PRODUCTION 

5 *2 IF PRODUCnON IS UST GENERATE A UST GENERATING COK^^ 

*3 ELSE GENERATE A TERM GHSNERATING COMMAND 
♦ 

*LIST GENERATING COMMAND: 

* 1 (ffiWBRATE LIST END EVENT: EMPTY IJST->EMPrySTl^ 

10 * 2 GENERATE LAST EVENT CASE: GENERATE A CALL FOR CURRENT USTS MEMBER 

* 3 OTHERWISE: GENERATE A CALL FOR CURRENT USTS MEMBER + 

♦ 4 GENERATE A lUECURSIVB LOOP FOR OTHER MEMBERS OF THE UOT 

The program, which is a consecutive list of commands, is collected into a string Str. 

genJPROGRAMiOn- 

15 gen_PROGRAM([Hl, Str):- genJ}OMMAND(H, Str), I. 
gen_PROGRAM([Hrn, Str):- 
gen_COMI\4AND(H, SI1), gen_PROGRAM(T, St2), fom)at{Str,"%s. St1.St2) . 

TERM GENERATING COMMAND: 

20 *1 FOR EACH PRODUCTION 

•2 FOR EACH SUBTERM 

•3 GENERATE A CLAUSE IN FORMAT 

*4 "gen_PR0DUCT10N{SUBTERM, Str)>" AS A HEAD AND 

*5 NUMBERED CALLS OF EACH SUBTERM PARAMETERS AND 

25 *6 AND PARAMETER OUTPUT STRINGS CALLED TOGETHER BY 

*7 AND SLIST AND gen.output-CALLS. 

♦ 8 WHERE Sir IS GENERATED SOURCE CODE FOR CURRENT LANGUAGE 

The assignment command contains a variable pn this case, its printout forni is Silrf) and a clause (in this 
30 case, its printout fomi is Sfr^ and they are added to the fonnatted string called "commandjassignment' 
using the list Slist The clause genjouput is retrieved into the fonnat string from the autojbm record, 
using the same argument 



gen_PROGRAM(0n. 
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gen_PR06RAM([H]. Str):- gen_COMMAND(H, Str). I. 
gen_PROGRAM([HlT]. Str):- 

gen_COMMAND(H. Sti). gen_PROGRAM(T, St2). fDrmat(Str."% . %". St1,SI2). 
gen_COMMAND(assignment(VAR1,EXPRESSION2) ,Str):- 
5 gen_VAR(VAR1,Str1). 

gen_EXPRESSION(EXPRESSION2,Sti2), 
SUst=[Str1^Str2]. 

gen_oiJtput(SList, "oommandjassignmenf , Str), I. 

10 gen_COMMAND(loop(PROGRAM1.VAR2.EXPRESSION3) ,Str):- 

g6n_PR0GRAM(PR0GRAM1.Str1), 

gen_VAR(VAR2.Sti2), 

gen_EXPRESSION(EXPRESSION3,Sb3), 

SIJ8t=IStr1,Sti2.Str3], 
IS genj9utput(SList, "command_loop", Str), I. 

gen_EXPRESSION(add(EXPRESSION1,EXPRESSION2) .Str):- 
gen_EXPRESSION(EXPRESSION1,Str1). 
gen_EXPRESS«ON(EXPRESSION2,Str2). 
20 SList=tStr1.Sti2]. 

gen_outpiJt(SList, "»q;>ression_add", Str), I. 

gen_EXPRESSION(value(VAR1) .Str):- 
gen_VAR(VAR1,Str1), 
26 SUst=tStr1J, 

gen_output(SIJst, "o(pres8ion_value", Str), I. 

gen_EXPRESSION(numl)er(INTEGER1) .Str):- 
strJnt(Str1.INTEGER1), 
30 SIJst=[Str1]. 

gen_output(SIJst, "aq9ression_numt)ef", Stt), I. 

gen_VAR(var(STRING1) .Str):- 
gen_STRING(STRING1 ,Str1), 
35 SUst=[Str1], 

gen_output(SList, "varjwir", Str), I. 

Genjoutput carried out ttie final finishing of the dause, using the format clauses. 



40 9) X-tenauaae fonmat daiises 
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The following is the corresponding PDL algorithm. 

♦1 FOR BAGH PRODUCTION AND SUBTERM NAME 
•2 GENERATB A FORMAT (XAUSE "auto Joon" FROM CURRE^ 

5 *3 AS A VARIABIJB PLACE AND OPERATOR TOKEN AS rrS PLACE 

♦4 NAME THDB CLAUSE BY A PRODUCTION AND SUBTBRMNAME AS PARAMETERS 

The translator developer goes through all the tenns in the database and constructs a data record of 
each one. The record has the torn auto_fo3rm(Id-stiing, fonnat-pait). The symbol signifying location, 
10 is assigned during the generating stage of the variables code. The fomiat clauses are constructed in the 
opposite sequence from grammar and abbreviations. Thus, the symbols ft eg, daiare Tisted. The strings 
are named in such a way that th^ have two parts: the master tenn and an underiine V and the name 
of a sub-term (for example, command assignment is "cQinmand_a8signmeiit"). 

15 auto Jbmrif command jasslgnment"."% = %"). 

auto_fonmrcommand_loop"»"repeat % until % < %"). 

autoJbrmfexpression_add"."% + %"). 

autoJbnnrexpression_value","%'*). 

auto.formrexpresslon.numbei",*^"). 
20 autDjbrm(*var_var,"%"). 

10) Conversion table, le, conversion instructions crea ted as the result of intBradiVB connecBon 

The get clauses are ready constructed in the Interface of the translator developer {Figures 3 and 4), 
25 which use the following algorithm. PROLOG-language clauses are obtained as the end result 
FOR EACH LINK OF FORMAT x(X) -> y(Y) 
GENERATE A GET-CLAUSE of FORMAT get_X^Y{X, Y):- SUBCLAUSES I. 
WHERE X AND Y ARE NON-^TERMINALS WITH PARAMETERS ON NON-TERMINALS 

AND SUBCLAUSES IS A UST OF LOWER LEVEL GET.CLAUSES DERIVED FROM PARA- 
30 METER COMBINATIONS OF X AND Y. 

In one interactive processing (manual stage) the result was the following clauses: 

getJ/AR_VARIABLE(var(STRING1), variab(name(STRING1))):- 1. 

35 

get_COMMAND_STATEMENT(assignment(VAR1.EXPRESSION2). . 

asslgnment_stmnt{assignment(VARIABLE1,Y_EXPRESSION2)))> 
getJVAR_VARIABLE(VAR1.VARIABLE1). 
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geLEXPRESSION_EXPRESSION(EXPRESS!ON2,Y_EXPRESSION2). I. 
get_COMMANO_STATEMENT(loop(PROGRAM1,VAR2,EXPRESStON3), 
repeat_STMNT(loop(STATEMENTLIST1 .less_than(EXPRESSION1 ,EXPRESSI0N2)))):- 

get_PROGRAM_STATEMENTUST(PROGRAM1 ,STATEMENTLIST1), 
5 get_VAR_EXPRESSION(VAR2.EXPRESSlON1). 

get_EXPRESSION_EXPRESSION(EXPRESSION3,EXPRESSION2), I. 

geLPROGRAM_STATEMENTLISTa],I]):- 1. 
geLPR0GRAM_STATEMENTLIST(IH1|T1J.(H2|T2D> 
10 get_C0MMAND_STATEMENT(H1, H2), get_PR0GRAM_STATEMENTLIST(T1,T2), I. 

% Debult code PROGRAM -> STATEMENTUST 
get_VAR_EXPRESSION(var(STRING1), name_expr(name(STRING1))):- 1. 
geLEXPRESSION_EXPRESSION(add{EXPRESSION1.EXPRESSION2), 

add(Y_EXPRESSION1,Y_EXPRESSION2)):- 
15 geLEXPRESSI0N_EXPRESSI0N(EXPRESSI0N1 , Y_EXPRESSI0N1), 
get_EXPRESSION_EXPRESSIpN(EXPRESSION2,Y_EXPRESSION2). I. 

get_EXPRESSION_EXPRESSION(value(VAR1).namejaxpr(Y_NAME1)):- 
get_VAR_NAME(VAR1,Y_NAME1), I. 

20 

get_EXPRESSION_EXPRESSION(number(lNTEGER1), lnt(INTEGER1)):-l. 
geLVAR_NAME(var(STRING1), name(STRING1)):- I. 

25 

11) Formal PtesentaBon of tamet l anouaae Ys grammar 12^ Convefsion of descritrtnie lanouaae 

^Prolog) 

PROGRAM = 

30 STATEMENTUST BLOCK -> progFam(STATEMENTUST, BLOCK) 

BLOCK = void main ipar ARGUST rpar COMPOUNDSTMNT 

-> main{ARGLIST, COMPOUNDSTMhn), 
STATEMENTUST -> statemeirtllst(STATEMENTLIST) 

35 

STATEMENTUST = STATEMENT* 



STATEMENT = 

FUNCnON.DERNITION semicolon 
40 LABELED.ST semicolon 



-> ia(FUNCnON_DEFINmON), 
->ls(LABELED_ST). 
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COMPOUNDSTMNT -> cs(COMPOUNDSTMNT), 

EXPRESSION semicolon -> expr(EXPRESSION), 

SELECTIONJST semicolon -> ss(SELECTION_ST), 

ITERAnON_ST semicolon -> is(ITERAnON_ST), 

5 JUMP_ST semicolon ->js(JUMP_ST), 

DECLARATION semicolon ■>> declare(0ECLARATION) 

ITERATION_ST = 

do STATEMENT while Ipar EXPRESSION rpar 
10 ->do(STATEMENT, EXPRESSION), 

while ipar EXPRESSION rpar STATEMENT 

-> while(EXPRESSION. STATEMENT), 
for Ipar EXPRESSIONLIST semicolon EXPRESSIONUST semicolon EXPRESSIONUST ipar 
STATEMENT 

15 -> fntEXPRESSIONUST. EXPRESSIONUST, EXPRESSIONUST. STATEMENT) 

ASSIGNMENT_EXPRESSION = 

UNARY_EXPRESSION ASSIGNMENTjOPERATOR ASSIGNMENT_EXPRKSION 
-> ase(UNARY_EXPRESSION, ASSIGNMENT_OPERATOR. ASSI6NMENT_EXPRESSION). 
CONDITIONAL_EXPRESSION 
20 ' -> ce(CONDmONAL_EXPRESSION), 

generatB(GENEXPRESSION) 

-> generate(GENEXPR^SION) 

GENEXPRESSION - 
25 GENEXPRESSION quesfionm GENEXPRESSION 

-> conditional_expr(GENEXPRESSION. GENEXPRESSION) 
GENEXPRESSION LOG_OP GENEXPRESSION 

-> logical_opein(GENEXPRESSION, LOG_OP, GENEXPRESSION) 
GENEXPRESSION MATH_OP GENEXPRESSION 
30 -> malh_oper(GENEXPRESSION. MATH_OP. GENEXPRESSION) 

GENEXPRESSION OP GENEXPRESSION 

-> relative_oper(GENEXPRESSION.OP,GENEXPRESSION) 
name(STRING) Ipar EXPRESSIONLIST rpar 

-> function_cali(STRING, EXPRESSIONUST) 
35 VAR ->var(VAR). 

CONSTANT ^ const(CONSTANT) 

ASSIGNMENT.OPERATOR = 

OP ->eq(OP). 

40 piusjeq -> plusjeq. 
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minusjeq 



-> minus.eq 



0P = 



eq 
ne 

gt 

it 



->eq, 
->ne, 

->gt. 

->ge, 
->le 



10 



13W lanauaaeHatossarv 

The Y-language glossary is not required, 
15 t>eGause the printout takes place using fbnmat 
clauses. 

The following is a sample of the glossary: 

strJok(";",semlcolon) 

str_tok(".",comma) 
20 strJtokCVold-.void) 

str_tokCmain",main) 

strJok{T.lpar) 

slrjDkn",rpar) 

strJok("ir,lfJ 
25 str_tokfelse''.else) 

slrJokrdo",do) 



str_tok("whlle",while) 

str.tokf^r^.for) 

str_tokr=",eq) 

strJokrr.ll)r) 

slrJokn",rt)r) 

strJok(":",cok)n) 

strJok{"case"»case) 

strJok("default".default) 

strJokrswitch".switch) 

str_tok("goto".goto) 

strjtok("continue'',continue) 

strJokCbreak**.break) 

strjokfretum^retum) 



14) Y-ianauaae datatypes: 

30 PROGRAM =prDgram(STATEMENTUST,BLOCK) 

BLOCK = malnCARGUST.COMPOUNDSTMhJT); 

stlist(STATEMEhJTUST) 



35 MAIN 



= main(DECLARATOR>VRGLIST) 



STATEMENTLIST = STATEMENT* 



STATEMENT = fd(FUNCTION_DERNITION); 
40 ls(LABELED.ST); 
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cs(COMPOUNDSTMNT); 

expr(EXPRESSION); 
SS(SELECT10N_ST); 
is{ITERATION_ST); 
5 js(JUMP_ST); 

cleciare(DECLARATION) 

COMPOUNDSTMNT » stmntnsl(STATEMENTUST) 

10 SELECnON.ST =lf_else(EXPRESSION.STATEMENT.STATEMENT); 
if_then(EXPRESSION.STATEMENT); 
switch(EXPRESSION,STATEMEND 

ITERATIONJST = do(STATEMENT,EXPRESSION); 
15 whHe(EXPRESSION.STATEMENT); 

fbr(EXPRESSIONUST,EXPRESSIONUST.EXPRESSIONUST,STATEMENT) 

ASSIGNMENT.OPERATOR » eq(OP) 

20 ASSIGNMENT_EXPRESSION = 

ase(UNARY_EXPRESSIONASSIGNMENT_OPERATORASSIGNMENT_EXPRESSION): 
ce(CONDmONAL_EXPRESSION): 
^neFate(GENEXPRESSiON) 

25 MATH_OP =plus0; 

minus{); 

mpy(); 

divj) 

30 EXPRESSION =asse(ASSIGNMENT.EXPRESSION); 

ce(CONDmONAL_EXPRESSION); , 
generate(GENEXPRESSION) 

EQUAUTY_EXPRESSION = eq_rel(EQUALITY_EXPRESSION.RELATIONAL.EXPRESSION); 
35 relexp(RELATIONAL_EXPRESSION) 

RELAT10NAL_EXPRESSI0N = lt(REUTIONAL_EXPRESSION,SHIFr_EXPRESSION); 

gt(RELATIONAL_EXPRESSrON,SHIFT.EXPRESSION); 
lt_eq(RELAT10NAL_EXPRESSI0N.SHIFT_EXPRESSI0Nj; 
40 gLeq(RELATIONAL_EXPRESSION,SHIFT_EXPRESSION); 
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sh_exp(SHIFT_EXPRESSION) 

SHIFT_EXPRESSION = lefl(SHIFT_EXPRESSION,ADDmVE_EXPRESSION); 

right(SHIFT_EXPRESSION,ADDmVE_EXPRESSION); 
5 adcl_exp(ADDmVE_EXPRESSION) 



10 



ADDIT1VE_EXPRESSI0N = add(ADDmVE_EXPRESSI0N.MULTIPLICATIVE_EXPRESSION): 
sub(ADDrnVE_EXPRESSION.MULTIPUCAT1VE_EXPRESSION).- 
mutt_exp(MULTIPUCATIVE_EXPRESSION) 

PRIMARY_EXPRESSION = var_expr(VAR); 

const(CONSTANT); 

slr{STRING,CURSOR); 

par_expr(EXPRESSION) 



15 



VAR 



= var(STRING,CURSOR) 



CONSTANT =nuno; 

true_0; 

20 falsej); 

ec(ENUMERATIONjCONSTANT); 

KiNTEGER); 

sttSTRINQ) 



25 

Scanner terms fY language^ 

A scanner is not used in the Y language, 
because reading talces place in the X language. 
30 The following is a sample of the scanner terms. 

PjrOK = semioolonO; 
commaO; 
voidO; 

35 mainO; 

lpar(); 
rparO; 
ifj): 



40 



doO; 



whileO; 

forO: 

eq(); 

m. 

rbrO: 

colonO; 

caseO: 

defaultO; 

switchO; 

gotoO; 

continueO; 

t>real((); 

number(INTE6ER); 

tnje_0; 

falsejO; 
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op(OP): 
niti 

18) Y-lanauaae parsing tooic 

5 

As this is not required in ttie example, it is not siiown. 

18> Y-ianauage generating code 
10 TTie following ^nriple gives the necessary code according to the prindple of sectk^ 
gen_y_STATEIVIENTUST(Dn 

gen_y_STATEiy/IENTLISTttH]. Str):- gen_>'_STATEMEhfr(H, Str), !. 
gen_y_STATEIVIENTLIST(IH|T], Str):- 
15 gen_y_STATEiyflENT(H. St1), gen_y_STATEiyflENTUST(T. St2). forniat(Str."% %", St1.St2). 

gen_y_STATB«ENT(cs(C0MP0UND_ST1) ,Str):- 
gen_y_COI\«POUNpjST(COMPOUND_ST1.Str1). 
SUst=[Str1], 
20 gen^_output(SList, "statement_c^, Str), 1. 

gen_y_STATEMENT(es(EXPRESSION_ST1) ,Str):- 
gen_y_EXPRESSION_ST(EXPRESSION_ST1 .Strl). 
SUst={Str1]. 
25 gen_yjoutput(SU5t, "islatementjes^, Str), I. 

gen_y_STATEMEIMT(is(ITERATION_ST1) ,Str):- 
gen_y_ITERATION_ST(ITERATION_Sri,Str1), 
SUstKStrl]. 
30 gen _y_oiitput(SLJ8t, "statement_is", Str). I. 

g6n_yJTERATION_ST(do(STATEMENT1.EXPRESSiON2) .Str):- 
gen_y_STATEMENT(STATEMENT1 .Strl). 
gen_y_EXPRESSiON(EXPRESSION2,Str2). 
35 SUst=[Sbi,Sti21, 

gei_y_output(SUst, "iterationjstjdo", Str). I. 

gen_y_ASSiGNMENT_0PERAT0R(eq(0P1) .Str):- 
gen_y_OP(OP1.Str1). 
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SIJst=[Str1J. 

gen_y_output(SList, "assignmentjoperatoreq'*, Str), I. 

gen _y_ASSIGNMENT_EXPRESSI0N(ase(UNARY_EXPRESSI0N1 , 
S ASSGNMENT_OPERATOR2^SSIGNMENT_EXPRESSION3) .Str):- 
gen_y_UNARY_EXPRESSION(UNARY_EXPRESSION1.Str1), 
gen_y_ASSIGNMENT_0PERAT0R(ASSIGNMENT_0PERAT0R2,Sti2), 
gen_y_ASSIGNMENT_EXPRESSION(ASSIGNMENT_EXPRESSION3.Sli3). 
SLIst^Str1,Sir2,Str3], 
10 gen_y_output(SList, "assigninentj»(pression_ase", Sb), I. 

gen_y_EXPRESSION(asse(ASSIGNMENT.EXPRESSION1 ) .Str):- 
gen_y_ASSIGNMENT_EXPRESSION(ASSIGNMENT.EXPRESSION1,Str1). 
SList=IStr1]. 

15 gen_y_output(SIJ8t, "expression_asse", Str), I. 

gen_y_VAR(var(STRING1^ .Str):- 
gen_y_STRING{STR»NQ1,Str1), 
SUst=[Str1]. 
20 gen_y_output(Sljsl, "varvai", Str). !. 

gen_y_C0NSTANT(i(INTEGER1) .Str):- 
strJnt(Str1,INTEGER1). 
SLjst=[Sin]. 
25 gen_y_output(SList, "conslant_r, Str), I. 

gen_Y_GENEXPRESSION(math_oper(GENEXPRESSION1,MATH_OP2. 

GENEXPRESSION3).Str):- 
gen_Y_GENEXPRESSION(GENEXPRESSlON1.Str1), 
30 gen_Y_MATH_OP{MATH_OP2,Sti2). 

genJLGENEXPRESSION(GENEXPRESSION3,Str3), 
SList=[Str1,Sti2.Str3], 

genjoutpuUSList. "gen«(pressk)n_math_opei", Str), 
! . • 

35 

gen_Y_GENEXPRESSION(relat>ve_oper(GENEXPRESSION1.0P2,GENEXPRESSION3) ,Str):- 
gen_Y_GENEXPRESSI0N(GENEXPRESSI0N1.Str1), 
gen_Y_OP{OP2.Sti2). 

gen_Y_GENEXPRESSION(GENEXPRESSION3,Str3), 
40 SLIst=IStr1,Sti2,Str31, 
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genjoutput(SList, "genexpression^relativejDper", Str), 
I. 

gen_Y_GENEXPRESSION(var(VAR1) .Str):- 
5 gen_Y.VAR(VAR1.Str1), 
SUst=[Str1], 

gen.output(SLIst, "genexpression^va;", Str), I. 

genjr_GENEXPRESSION(consl<CONSTANT1) ,Str)> 
10 gen_Y_CONSTANT(CONSTANT1.Str1), 
SList=[Str1J. 

gen_output{SLIst, "genexpression.consf*, Str), 1. 
genjr_MATH_OP(plus."+"). 

15 

gen.Y.OPCeq,'^. 

Below are also the formatting operations that are language-independent (general-purpose tool): 

20 genjoutput(Sllst, Fonm, Str):- 
autoJbrm(Form, StrF). 
ft)nmaLloop(Str, StrF, SLIst), I. 
genjoutput(Sll8t. Form. Str):- 
fbn7)at(Txt."\n %:", Form), 
25 gen_slist_Str(SList, Txt Str) J. 
gen_output(SHst Form, Str);- 
not(autoJbrm(Form, J), 
gen.slist_Str(SLjst,V,Str). 

30 19) Y«lanauaae format clauses 

The following are samples of the format clauses: 

auto_fonmC*program_j)rogram",'*% %"). 

auto_fbnfnC'block_main^*Void main ( % ) 

auto_fbmnrbIock_stlisr."%"). 
35 auto Jic>nn("main_main","% main (% )'^. 

autojbrmf compoundstmntj5tmntlisr,"{ % }••). 

auto Jbmri("selecllon_sl_if_else","if ( % ) % else %"). 

auto_fbmfirselectionjst_HLthen",Tf ( % ) %7 

aulDjbrmrseteclton^stjswitch"^^ (%)%"). 
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auto_fbrmriteration_st.do"."do % while ( % )*% 

autoJbrmOteration_stjvhile'\"whiIe ( % ) %"). 

auto_fbrmriteration_sLfor/fbr (%;%;%)%-). 

autq_fbrm("assignment_operator_eq""%"). 
5 aLitoJ6rTn("assignment_expression_ase""% % %"). 

aiitoJbrmf'assignmentjBxpression_ce".''%'0. 

autDjlbrm("assignment_expressionjgenerate""%"). 

autojbrmrgenexpression.mathjoper" "% % 

aulDjbrmf genexpresstonjBlat^^ % %"). 

10 autojbrmrgenexpressk)n_var*,'^"). 

autoJormCgenexpression^consf ,"%-). 

auto_formC*expressk)n_asse'','%'). 

autojbrm("expressk)njce'',"%*'). 

autDjbrmC*expnessionjgenerate",'*%"). 
15 autoJbrmrconstantje)q)ressionjconcr,"%"^ 

autoJit)m)("relationaLexpre8«on_l^ < %"). 

autDjbnnrrelationaL&(pfBssionjgr,**% > %"). 

autojbrmf relatlonaLexpresslonJt_eq"."% <= %"). 

auto_formC*relationaljBxpressk)n jgLeq","% >= %"). 
20 autojbrmf rBlatlonaLexpression_shjBxp","%"). 



wo 02/093371 



PCT/FI02/00411 



42 

Claims 

1. A method for developing a translator, which translator is intended to convert input-language code into 
target-language code, and In which method a descriptive language (V) Is used to formally depict two 

5 source languages that are independent of each other, that is, the said input language (X) and the target 
language (Y), each source language including fonnal master tenns {\Y) and in each master temi there 
being one or several occurrences with possible parameters in these, and in which a program framework 
is formed for the translator, as well as a group of files, which are linked tog^er and translated for the 
selected operating system, 

10 chafacterized In that the said file are formed In the foltowInQ stages: 

- the grammars of both source languages (X and Y) are stored in a selected format in files, in such a 
way that all the occunrences of the master terms of both languages are itemized (stages 1 and 1 

- descriptive language versions (VX and VY) of both source languages (X and Y) ans formed in a 
database, in which descriptive language each occurrence of a term (VX|,VYj) is stated semantrcally, 

15 with the aid of the selected descriptive language term (VJ and the defined tenris of the source 
language, (stages 2 and 2"), 

- the accessory files VX(a-e) and VY(a-e), such as for e)cample, 

a) glossary and scanner terms 

b) datatypes 
20 c) parsing k)gic 

d) generating code 

e) format clauses 

required for the translator are fomned from the descriptive language versk>ns (VX) and (VY) of the input 
and target languages (VX) and (VY) and from the stored grammars of the source languages (X and Y) 
25 (stages 3 and 3*), 

- the interactive connectton of each converied input-language term VX, to the selected target-language 
tenm VY^ is canried out, comprising in steps of (stages 4 and 5): 

- the connectk>n of the master terms to each other 

- the matching of the occunrences to each other, and 

30 - the conversion inslrucBon (VX^VY) of each converted input-language \em (VX,) is stored in a file. 

2. A method according to Claim 1, characterized In that: 

- the input language's parsing k)gic (VX(c)) and 

- the necessary generating code (VY(d)) of the target language and 
35 - the conversbn instructk)ns (VX-^VY) and 

- the necessary format clauses (VY(e)) of the target language, are stored in a database or similar for 
the translator, so that the translator can form, with the aM of the parsing togic (VX(c)) and the code of 
the input language to be translated, a parsing tree (stage 9) including the code in descriptive language 
form and convert the code with the aid of the conversion instructions (VX>^VY) into descriptive 
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language form (stage 10) and generate and format the descriptive language code into target- 
language code (stage 11). with the aid of target-language generation and fbrnnat clauses (VY(d,e)). 

3. A method according to Claims 1 or 2. characterized in that in the interactive connection, an inference 
5 engine is used, which exploits one or more of the following criteria: 

- the Uniting of previously connected occun^ences is proposed, 

- the linking of occunnences/paFameters having the same name, on the basis of descriptive language 
temras is proposed, 

- the linking of parameters on the t>asis of order is proposed. 

10 

4. A method according to any of Claims 1 - 3. characterized in that the interactive connectron is canied 
out using a graphteal interface including at least selectk}n windows for the tenms being proposed, for the 
converskMi instructions t)eing fbnned, as well as at least one pop-up menu window for the selectk>n list 
and in which in each selection window each component acts as a link to the conesponding selectbn list 

15 that appears. 

5. A method according to any of Claims 1-4, characterized in that the PROLOG language Is used as 
the descriptive language and/or the source language of the translator. 

20 6. A system for translating a computer program from a first source language, i.e. the input language (X), 
to a second source language, i.e. the target language (Y), which system includes 

- an input file (7) including several lines of code containing the input-language computer program, 

- a translator (X>Y) connected to the Input file (7), whtoh translator reads the input file and generates 
a translated version of the computer program, and in which the translator includes conversion 

25 instructk>ns (VX>^VY, 41) relating to each input-language term, 

- a translated file (12) connected to the translator (X>Y), for receiving the translated versk>n of the 
computer program thus generate, 

- an operatbn library, containing routines to be called by the translator (X>Y), characterized in that the 
system also Includes: 

30 - a first accessory file (VX(c), 31c) containing the input-language parsing logte for the selected 
semantto descriptive language (V), 

- a second accessory file (P(Y(d,e), 42), containing the source-language generating and fonmat 
clauses; 

in which case the translator (X>Y) is anranged: 
35 - to convert the computer program's lines of code first of all into descriptive language form, using ttie 
parsing logic of the first accessory file VX(c), (stage 9); and 

- then to convert them in descriptive language form, using the saM conversion instmction (VX>^VY,41), 
(stage 10) and 

- to generate and fonnat the descriptive language code into fomnal target-language cQde (stage 1 1), 
40 using the target-language generating code (VY(d), 42) and fomnat dauses (VY(e), 42). 
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7. A system according to Claim 6, characterized in that the target language (Y) is a selected-form 
documentation format for the automatic documentation of the input-language (X) program, for example, 
one of the following formats: module listing, variable list, cross-reference table, graphical hierarchic 
5 diagram, data-flow diagram. 
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